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iMMU^Q<aEM€::CQMP'QS-ITiaMS FOR GRAM POSITIVE BACTERIA SUCH AS 

STREPTOCOCCUS A GALA CTIAE 



FIELD OF THE INVENTION 

5 The invention relates to the identification of adhesin islands within the genome Streptococcus 

agalactiae ("GBS") and the use of adhesin island amino acid sequences encoded by these adhesin 
islands in compositions for the treatment or prevention of GBS infection. Similar sequences have 
been identified in other Gram positive bacteria. The invention further includes immunogenic 
compositions comprising adhesin island amino acid sequences of Gram positive bacteria for the 

10 treatment or prevention of infection of Gram positive bacteria. Preferred immunogenic compositions 
of the invention include an adhesin island surface protein which may be formulated or purified in an 
oligomeric or pilus form. 
BACKGROUND OF THE INVENTION 

GBS has emerged in the last 20 years as the major cause of neonatal sepsis and meningitis 

15 that affects 0.5 - 3 per 1000 live births, and an important cause of morbidity among older age groups 
affecting 5-8 per 100,000 of the population. Current disease management strategies rely on 
intrapartum antibiotics and neonatal monitoring which have reduced neonatal case mortality from 
>50% in the 1970's to less than 10% in the 1990's. Nevertheless, there is still considerable morbidity 
and mortality and the management is expensive. 15 - 35% of pregnant women are asymptomatic 

20 carriers and at high risk of transmitting the disease to their babies. Risk of neonatal infection is 

associated with low serotype specific maternal antibodies and high titers are believed to be protective. 
In addition, invasive GBS disease is increasingly recognized in elderly adults with underlying disease 
such as diabetes and cancer. 

The "B" in "GBS" refers to the Lancefield classification, which is based on the antigenicity of 

25 a carbohydrate which is soluble in dilute acid and called the C carbohydrate. Lancefield identified 13 
types of C carbohydrate, designated A to O, that could be serologically differentiated. The organisms 
that most commonly infect humans are found in groups A, B, D, and G. Within group B, strains can 
be divided into at least 9 serotypes (la, lb, Ia/c, II, III, IV, V, VI, VII and VIII) based on the structure 
of their polysaccharide capsule. In the past, serotypes la, lb, II, and III were equally prevalent in 

30 normal vaginal carriage and early onset sepsis in newborns. Type V GBS has emerged as an 

important cause of GBS infection in the USA, however, and strains of types VI and VIII have become 
prevalent among Japanese women. 

The genome sequence of a serotype V strain 2603 V/R has been published (See Tettelin et al 
(2002) Proc. Natl Acad. Set USA, 10.1073/pnas. 182380799) and various polypeptides for use a 

35 vaccine antigens have been identified (WO 02/34771). The vaccines currently in clinical trials, 

however, are based primarily on polysaccharide antigens. These suffer from serotype-specificity and 
poor immunogenicity, and so there is a need for effective vaccines against S.agalactiae infection. 
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JJ...J, |j...„ ^£*gj]<^ positive bacterium, a collection of about 21 genera of 

bacteria that colonize humans, have a generally spherical shape, a positive Gram stain reaction and 
lack endospores. Gram positive bacteria are frequent human pathogens and include Staphylococcus 
(such as S. aureus), Streptococcus (such as S. pyogenes (GBS), S. pyogenes (GAS), S. pneumonaie, S. 
5 mutans), Enterococcus (such as E.faecalis and E.faecium), Clostridium (such as C. difficile), Listeria 
(such as L. monocytogenes) and Coiynebacterium (such as C. diphtheria). 

It is an object of the invention to provide further and improved compositions for providing 
immunity against disease and/or infection of Gram positive bacteria. The compositions are based on 
the identification of adhesin islands within Streptococcal genomes and the use of amino acid 
10 sequences encoded by these islands in therapeutic or prophylactic compositions. The invention 

further includes compositions comprising immunogenic adhesin island proteins within other Gram 
positive bacteria in therapeutic or prophylactic compositions. 
SUMMARY OF THE INVENTION 

Applicants have identified a new adhesin island, "GBS Adhesin Island 1", "AM" or "GBS 
15 AM", within the genomes of several Group B Streptococcus serotypes and isolates. This adhesin 
island is thought to encode surface proteins which are important in the bacteria's virulence. In 
addition, Applicants have discovered that surface proteins within GBS Adhesin Islands form a 
previously unseen pilus structure on the surface of GBS bacteria. Amino acid sequences encoded by 
such GBS Adhesin Islands may be used in immunogenic compositions for the treatment or prevention 
20 of GBS infection. 

A preferred immunogenic composition of the invention comprises an AM surface protein, 
such as GBS 80, which may be formulated or purified in an oligomeric (pilus) form. In a preferred 
embodiment, the oligomeric form is a hyperoligomer. Electron micrographs depicting some of the 
first visualizations of this pilus structure in a wild type GBS strain are shown in Figures 16, 17, 49, 
25 and 50. In addition, Applicants have transformed a GBS strain with a plasmid comprising the AI 
surface protein GBS 80 which resulted in increased production of that AI surface protein. The 
electron micrographs of this mutant GBS strain in Figures 13 - 15 reveal long, hyper-oligomeric 
structures comprising GBS 80 which appear to cover portions of the surface of the bacteria and stretch 
far out into the supernatant. These hyper-oligomeric pilus structures comprising a GBS AI surface 
30 protein may be purified or otherwise formulated for use in immunogenic compositions. 

GBS AM comprises a series of approximately five open reading frames encoding for a 
collection of amino acid sequences comprising surface proteins and sortases ("AM proteins"). 
Specifically, AM includes polynucleotide sequences encoding for two or more of GBS 80, GBS 104, 
GBS 52, SAG0647 and SAG0648. One or more of the AM polynucleotide sequences may be 
35 replaced by a polynucleotide sequence coding for a fragment of the replaced ORF. Alternatively, one 
or more of the AM open reading frames may be replaced by a sequence having sequence homology 
(sequence identity) to the replaced ORF. 
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jp p *3^I-3L" ^jy^isg 11 !^ ^jpi^ e g:^^i^"P5^^|P matel y 16.1 kb transposon-like element frequently 
inserted into the open reading frame for trtnA. One or more of the AI-1 surface protein sequences 
typically include an LPXTG motif (such as LPXTG (SEQ ID NO: 122)) or other sortase substrate 
motif. The AI surface proteins of the invention may affect the ability of the GBS bacteria to adhere to 
5 and invade epithelial cells. AI surface proteins may also affect the ability of GBS to translocate 

through an epithelial ceil layer. Preferably, one or more AI surface proteins are capable of binding to 
or otherwise associating with an epithelial cell surface. AI surface proteins may also be able to bind 
to or associate with fibrinogen, fibronectin, or collagen. 

The sortase proteins are thought to be involved in the secretion and anchoring of the LPXTG 
10 containing surface proteins. AI-1 may encode at least one surface protein. Alternatively, AI-1 may 
encode at least two surface proteins and at least one sortase. Preferably, AI-1 encodes for at least 
three surface proteins and at least two sortases. One or more of the surface proteins may include an 
LPXTG motif or other sortase substrate motif. 

The GBS AI-1 protein of the composition may be selected from the group consisting of GBS 
15 80, GBS 104, GBS 52, SAG0647 and SAG0648. GBS AI-1 surface proteins GBS 80 and GBS 104 
are preferred for use in the immunogenic compositions of the invention. 

In addition to the open reading frames encoding the AI-1 proteins, AI-1 may also include a 
divergently transcribed transcriptional regulator such as araC (i.e., the transcriptional regulator is 
located near or adjacent to the AI protein open reading frames, but it transcribed in the opposite 
20 direction). It is believed that araC may regulate the expression of the GBS AI operon. (See Korbel et 
al., Nature Biotechnology (2004) 22(7): 91 1 - 917 for a discussion of divergently transcribed 
regulators in E. coli). 

A second adhesin island, "Adhesin Island-2", "AI-2" or "GBS AI-2", has also been identified 
in numerous GBS serotypes. Amino acid sequences encoded by the open reading frames of AI-2 may 

25 also be used in immunogenic compositions for the treatment or prevention of GBS infection. 

GBS AI-2 comprises a series of approximately five open reading frames encoding for a 
collection of amino acid sequences comprising surface proteins and sortases. Specifically, AI-2 
includes open reading frames encoding for two or more of GBS 67, GBS 59, GBS 150, SAG1405, 
SAG1406, 01520, 01521, 01522, 01523, 01523, 01524 and 01525. The GBS AI-2 sequences may be 

30 divided into two subgroups. In one embodiment, AI-2 includes open reading frames encoding for two 
or more of GBS 67, GBS 59, GBS 150, SAG1405, and SAG1406. This collection of open reading 
frames may be generally referred to as GBS AI-2 subgroup 1 . Alternatively, AI-2 may include open 
reading frames encoding for two or more of 01520, 01521, 01522, 01523, 01523, 01524 and 01525. 
This collection of open reading frames may be generally referred to as GBS AI-2 subgroup 2. 

35 One or more of the AI-2 open reading frame polynucleotide sequences may be replaced by a 

polynucleotide sequence coding for a fragment of the replaced ORR Alternatively, one or more of 
the AI-2 open reading frames may be replaced by a sequence having sequence homology (sequence 
identity) to the replaced ORF. 
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ip, ^ne ( ,or ^^r^^^e t J^2^^^j^ptems typically include an LPXTG motif (such as LPXTG 

(SEQ ID NO: 122)) or other sortase substrate motif. These sortase proteins are thought to be involved 
in the secretion and anchoring of the LPXTG containing surface proteins. AI-2 may encode for at 
least one surface protein. Alternatively, AI-2 may encode for at least two surface proteins and at least 
5 one sortase. Preferably, AI-2 encodes for at least three surface proteins and at least two sortases. One 
or more of the surface proteins may include an LPXTG motif. 

The AI-2 protein of the composition may be selected from the group consisting of GBS 67, 
GBS 59, GBS 150, SAG1405, SAG1406, 01520, 01521, 01522, 01523, 01523, 01524 and 01525. AI- 
2 surface proteins GBS 67, GBS 59, and 01524 are preferred AI-2 proteins for use in the 
10 immunogenic compositions of the invention. GBS 67 or GBS 59 is particularly preferred. 

GBS AI-2 may also include a divergently transcribed transcriptional regulator such as a RofA 
like protein (for example rogB). As in AI-1, rogB is thought to regulate the expression of the AI-2 
operon. 

The GBS AI proteins of the invention may be used in immunogenic compositions for 
15 prophylactic or therapeutic immunization against GBS infection. For example, the invention may 
include an immunogenic composition comprising one or more GBS AI-1 proteins and one or more 
GBS AI-2 proteins. 

The immunogenic compositions may also be selected to provide protection against an 
increased range of GBS serotypes and strain isolates. For example, the immunogenic composition 

20 may comprise a first and second GBS AI protein, wherein a full length polynucleotide sequence 
encoding for the first GBS AI protein is not present in a genome comprising a full length 
polynucleotide sequence encoding for the second GBS AI protein. In addition, each antigen selected 
for use in the immunogenic compositions will preferably be present in the genomes of multiple GBS 
serotypes and strain isolates. Preferably, each antigen is presnt in the genomes of at least two (i.e., 3, 

25 4, 5, 6, 7, 8, 9, 10, or more) GBS strain isolates. More preferably, each antigen is present in the 
genomes of at least two (Le. 9 at least 3, 4, 5 or more) GBS serotypes. 

Within GBS AI-1, Applicants have found that Group B Streptococcus surface exposure of 
GBS 104 is dependent on the concurrent expression of GBS 80. It is thought that GBS 80 is involved 
in the transport or localization of GBS 104 to the surface of the bacteria. The two proteins may be 

30 oligomerized or otherwise chemically or physically associated. It is possible that this association 

involves a conformational change in GBS 104 that facilitates its transition to the surface of the GBS 
bacteria. In addition, one or more AI sortases may also be involved in this surface localization and 
chemical or physical association. Similar relationships are thought to exist within GBS AI-2. The 
compositions of the invention may therefore include at least two AI proteins, wherein the two AI 

35 proteins are physically or chemically associated. Preferably, the two AI proteins form an oligomer. 
Preferably, one or more of the AI proteins are in a hyper-oligomeric form. In one embodiment, the 
associated AI proteins may be purified or isolated from a GBS bacteria or recombinant host cell. 
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^, }{ .Jps^lso^u^^^ to provide further and improved compositions for 

providing prophylactic or therapeutic protection against disease and/or infection of Gram positive 

bacteria. The compositions are based on the identification of adhesin islands within Streptococcal 

genomes and the use of amino acid sequences encoded by these islands in therapeutic or prophylactic 

5 compositions. The invention further includes compositions comprising immunogenic adhesin island 

proteins within other Gram positive bacteria in therapeutic or prophylactic compositions. Preferred 

Gram positive adhesin island proteins for use in the invention may be derived from Staphylococcus 

(such as S. aureus), Streptococcus (such as S. agalactiae (GBS), S. pyogenes (GAS), S. pneumonaie, 

S. mutans), Enterococcus (such as E.faecalis andE.faecium), Clostridium (such as C difficile), 

10 Listeria (such as L. monocytogenes) and Corynebacterium (such as C. diphtheria). Preferably, the 

Gram positive adhesin island surface proteins are in oligomeric or hyperologimeric form. 

For example, Applicants have identified adhesin islands within the genomes of several Group 

A Streptococcus serotypes and isolates. These adhesion islands are thought to encode surface proteins 

which are important in the bacteria's virulence, and Applicants have obtained the first electron 

15 micrographs revealing the presence of these adhesin island proteins in hyperoligomeric pilus 

structures on the surface of Group A Streptococcus. 

Group A Streptococcus is a human specific pathogen which causes a wide variety of diseases 

ranging from pharyngitis and impetigo through life threatening invasive disease and necrotizing 

fasciitis. In addition, post-streptococcal autoimmune responses are still a major cause of cardiac 

20 pathology in children. 

Group A Streptococcal infection of its human host can generally occur in three phases. The 

first phase involves attachment and/or invasion of the bacteria into host tissue and multiplication of 

the bacteria within the extracellular spaces. Generally this attachment phase begins in the throat or 

the skin. The deeper the tissue level infected, the more severe the damage that can be caused. In the 

25 second stage of infection, the bacteria secretes a soluble toxin that diffuses into the surrounding tissue 

or even systemically through the vasculature. This toxin binds to susceptible host cell receptors and 

triggers innappropropriate immune responses by these host cells, resulting in pathology. Because the 

toxin can diffuse throughout the host, the necrosis directly caused by the GAS toxins may be 

physically located in sites distant from the bacterial infection. The final phase of GAS infection can 

30 occur long after the original bacteria have been cleared from the host system. At this stage, the host's 

previous immune response to the GAS bacteria due to cross reactivity between epitopes of a GAS 

surface protein, M, and host tissues, such as the heart. A general review of GAS infection can be 

found in Principles of Bacterial Pathogeneis, Groisman ed., Chapter 15 (2001). 

In order to prevent the pathogenic effects associated with the later stages of GAS infection, an 

35 effective vaccine against GAS will preferably facilitate host elimination of the bacteria during the 

initial attachment and invasion stage. 

Isolates of Group A Streptococcus are historically classified according to the M surface 

protein described aboye. The M protein is surface exposed trypsin-sensitive protein generally 
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c< fM|fi^ ll^i;W^^ xed in an al P ha helical formation. The carboxyl terminus 

is anchored in the cytoplasmic membrane and is highly conserved among all group A streptococci. 
The amino terminus, which extend through the cell wall to the cell surface, is responsible for the 
antigenic variability observed among the 80 or more serotypes of M proteins. 

A second layer of classification is based on a variable, trypsin-resistant surface antigen, 
commonly referred to as the T-antigen. Decades of epidemiology based on M and T serological 
typing have been central to studies on the biological diversity and disease causing potential of Group 
A Streptococci. While the M-protein component and its inherent variability have been extensively 
characterized, even after five decades of study, there is still very little known about the structure and 
variability of T-antigens. Antisera to define T types is commercially available from several sources, 
including Sevapharma (http://www.sevapharma.cz/en). 

The gene coding for one form of T-antigen, T-type 6, from an M6 strain of GAS (D741) has 
been cloned and characterized and maps to an approximately 1 1 kb highly variable pathogenicity 
island. Schneewind et al., J Bacteriol. (1990) 172(6):33 10 - 33 17. This island is known as the 
Fibronectin-binding, Collagen-binding T-antigen (FCT) region because it contains, in addition to the 
T6 coding gene (tee6), members of a family of genes coding for Extra Cellular Matrix (ECM) binding 
proteins. Bessen et al., Infection & Immunity (2002) 70(3): 1 159-1 1 67. Several of the protein 
products of this gene family have been shown to directly bind either fibronectin and/or collagen. See 
Hanski et al., Infection & Immunity (1992) 60(12):5 1 19-5 125; Talay et al, Infection & Immunity 
(1992( 60(9):3837-3844; Jaffe et al. (1996) 2 1(2): 3 73 -3 84; Rocha et al., Adv Exp Med Biol. (1997) 
418:737-739; Kreikemeyer et al., J Biol Chem (2004) 279(16):15850-15859; Podbielski et al., Mol. 
Microbiol. (1999) 31(4): 1051-64; and Kreikemeyer et al., Int. J. Med Microbiol (2004) 294(2-3):177- 
88. In some cases direct evidence for a role of these proteins in adhesion and invasion has been 
obtained. 

Applicants raised antiserum against a recombinant product of the teed gene and used it to 
explore the expression of T6 in M6 strain 2724. In immunoblot of mutanolysin extracts of this strain, 
the antiserum recognized, in addition to a band corresponding to the predicted molecular mass of the 
product, very high molecular weight ladders ranging in mobility from about 100 kDa to beyond the 
resolution of the 3-8% gradient gels used. 

This pattern of high molecular weight products is similar to that observed in immunoblots of 
the protein components of the pili identified in Streptococcus agalactiae (described above) and 
previously in Corynebacterium diphtherial Electron microscropy of strain M6_2724 with antisera 
specific for the product of tee6 revealed abundant surface staining and long pilus like structures 
extending up to 700 nanometers from the bacterial surface, revealing that the T6 protein, one of the 
antigens recognized in the original Lancefiled serotyping system, is located within a GAS Adhesin 
Island (GAS AI-1) and forms long covalently linked pilus structures. 

Applicants have identified at least four different Group A Streptococcus Adhesin Islands. 

While these GAS Al sequences can be identified in numerous M types, Applicants have surprisingly 

-6- 



WO 2006/078318 PCT/US2005/027239 

dijpjre^ pilus subunits from the four different GAS AI types 

and specific T classifications. While other trypsin-resistant surface exposed proteins are likely also 
implicated in the T classification designations, the discovery of the role of the GAS adhesin islands 
(and the associated hyper-oligomeric pilus like structures) in T classification and GAS serotype 
variance has important implications for prevention and treatment of GAS infections. Applicants have 
identified protein components within each of the GAS adhesin islands which are associated with the 
pilus formation. These proteins are believed to be involved in the bacteria's initial adherence 
mechanisms. Immunological recognition of these proteins may allow the host immune response to 
slow or prevent the bacteria's transition into the more pathogenic later stages of infection. 

In addition, Applicants have discovered that the GBS pili structures appear to be implicated in 
the formation of biofilms (populations of bacteria growing on a surface, often enclosed in an 
exopolysaccharide matrix). Biofilms are generally associated with bacterial resistance, as antibiotic 
treatments and host immune response are frequently unable to erradicate all of the bacteria 
components of the biofilm. Direction of a host immune response against surface proteins exposed 
during the first steps of bacterial attachment (i.e., before complete biofilm formation) is preferable. 

The invention therefore provides for improved immunogenic compositions against GAS 
infection which may target GAS bacteria during their initial attachment efforts to the host epithelial 
cells and may provide protection against a wide range of GAS serotypes. The immunogenic 
compositions of the invention include GAS AI surface proteins which may be formulated in an 
oligomeric, or hyperoligomeric (pilus) form. The immunogenic compositions of the invention may 
include one or more GAS AI surface proteins. The invention also includes combinations of GAS AI 
surface proteins. Combinations of GAS AI surface proteins may be selected from the same adhesin 
island or they may be selected from different GAS adhesin islands. 

Amino acid sequence encoded by such GAS Adhesin Islands may be used in immunogenic 
compositions for the treatment or prevention of GAS infection. Preferred immunogenic compositions 
of the invention comprise a GAS AI surface protein which has been formulated or purified in an 
oligomeric (pilus) form. In a preferred embodiment, the oligomeric form is a hyperoligomer. 

GAS Adhesin Islands generally include a series of open reading frames within a GAS genome 
that encode for a collection of surface proteins and sortases. A GAS Adhesin Island may encode for 
an amino acid sequence comprising at least one surface protein. The Adhesin Island, therefore, may 
encode at least one surface protein. Alternatively, a GAS Adhesin Island may encode for at least two 
surface proteins and at least one sortase. Preferably, a GAS Adhesin Island encodes for at least three 
surface proteins and at least two sortases. One or more of the surface proteins may include an 
LPXTG motif (such as LPXTG (SEQ ID NO: 122)) or other sortase substrate motif. One or more 
GAS AI surface proteins may participate in the formation of a pilus structure on the surface of the 
Gram positive bacteria. 
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P C t$$Bf3 S^fiPf'^S^Ef ^ ion preferably include a divergently transcribed 
transcriptional regulator. The transcriptional regulator may regulate the expression of the GAS AI 
operon. Examples of transcriptional regulators found in GAS AI sequences include RofA and Nra. 

The GAS AI surface proteins may bind or otherwise adhere to fibrinogen, fibronectin, or 
collagen. One or more of the GAS AI surface proteins may comprise a fimbrial structural subunit. 

One or more of the GAS AI surface proteins may include an LPXTG motif or other sortase 
substrate motif. The LPXTG motif may be followed by a hydrophobic region and a charged C 
terminus, which are thought to retard the protein in the cell membrane to facilitate recognition by the 
membrane-localized sortase. See Barneft, et aL, J. Bacteriology (2004) 186 (17): 5865-5875. 

GAS AI sequences may be generally categorized as Type 1, Type 2, Type 3, or Type 4, 
depending on the number and type of sortase sequences within the island and the percentage identity 
of other proteins (with the exception of RofA and cpa) within the island. Schematics of the GAS 
adhesin islands are set forth in FIGURE 51A and FIGURE 162. "GAS Adhesin Island- 1 or "GAS AL- 
T' comprises a series of approximately five open reading frames encoding for a collection of amino 
acid sequences comprising surface proteins and sortases ("GAS AI-1 proteins"). GAS AI-1 
preferably comprises surface proteins, a srtB sortase and a rofA divergently transcribed transcriptional 
regulator. GAS AM surface proteins may include a fibronectin binding protein, a collagen adhesion 
protein and a fimbrial structural subunit. The fimbrial structural subunit (also known as tee6) is 
thought to form the shaft portion of the pilus like structure, while the collagen adhesion protein (Cpa) 
is thought to act as an accessory protein facilitating the formation of the pilus structure, exposed on 
the surface of the bacterial capsule. 

Specifically, GAS AI-1 includes polynucleotide sequences encoding for two or more of 
M6_Spy0157, M6_Spy0158, M6jSpy0159, M6JSpy0160, M6_Spy0161. The GAS AI-1 may also 
include polynucleotide sequences encoding for any one of CDC SS 410_Jimbrial, ISS3650jfimbrial, 
DSM2071_fimbrial 

A preferred immunogenic composition of the invention comprises a GAS AI-1 surface 
protein which may be formulated or purified in an oligomeric (pilus) form. In a preferred 
embodiment, the oligomeric form is a hyperoligomer. The immunogenic composition of the 
invention may alternatively comprise an isolated GAS AI-1 surface protein in oligomeric (pilus) form. 
The oligomer or hyperoligomeric pilus structures comprising GAS AI-1 surface proteins may be 
purified or otherwise formulated for use in immunogenic compositions. 

One or more of the GAS AI-1 polynucleotide sequences may be replaced by a polynucleotide 
sequence coding for a fragment of the replaced ORF. Alternatively, one or more of the GAS AI-1 
open reading frames may be replaced by a sequence having sequence homology (sequence identity) to 
the replaced ORF. 

One or more of the GAS AI-1 surface proteins typically include an LPXTG motif (such as 

LPXTG (SEQ ID NO: 122)) or other sortase substrate motif. These sortase proteins are thought to be 

involved in the secretion and anchoring of the LPXTG containing surface proteins. GAS AI-1 may 
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epo^fjpr ; at jlg^stj; flljeji^fa^B; |5 t ro^eiiri;::j; J&Jternati vely , GAS AM may encode for at least two surface 

proteins and at least one sortase. Preferably, GAS AM encodes for at least three surface proteins and 
at least two sortases. One or more of the surface proteins may include an LPXTG motif. 

GAS AM preferably includes a srtB sortase. GAS srtB sortases may preferably anchor 
5 surface proteins with an LPSTG motif (SEQ ID NO: 166), particularly where the motif is followed by 
a serine. 

The GAS AM protein of the composition may be selected from the group consisting of 
M6_Spy0157, M6_Spy0158, M6_Spy0159, M6_Spy0160 M6jSpy0161, CDC SS 410_fimbrial, 
ISS3650_fimbrial, and DSM2071_fimbrial. GAS AM surface proteins M6_Spy0157 (a fibronectin 

10 binding protein), M6__Spy0159 (a collagen adhesion protein, Cpa), M6jSpy0160 (a fimbrial structural 
subunit, tee6), CDC SS 410_fimbrial (a fimbrial structural subunit), ISS3650_fimbrial (a fimbrial 
structural subunit), and DSM207 1 Jftmbrial (a fimbrial structural subunit) are preferred GAS AM 
proteins for use in the immunogenic compositions of the invention. The fimbrial structural subunit 
tee6 and the collagen adhesion protein Cpa are preferred GAS AI —1 surface proteins. Preferably, 

15 each of these GAS AI-1 surface proteins includes an LPXTG sortase substrate motif, such as LPXTG 
(SEQ ID NO: 122) or LPXSG (SEQ ID NO: 134) (conservative replacement of threonine with 
serine). 

In addition to the open reading frames encoding the GAS AI-1 proteins, GAS AI-1 may also 
include a divergently transcribed transcriptional regulator such as rofA (i.e., the transcriptional 

20 regulator is located near or adjacent to the GAS AI protein open reading frames, but it transcribed in 
the opposite direction). 

The GAS AI-1 surface proteins may be used alone, in combination with other GAS AI-1 
surface proteins or in combination with other GAS AI surface proteins. Preferably, the immunogenic 
compositions of the invention include the GAS AI-1 fimbrial structural subunit (tee6) and the GAS 

25 AM collagen binding protein. Still more preferably, the immunogenic compositions of the invention 
include the GAS AI-1 fimbrial structural subunit (tee6). 

A second GAS adhesion island, "GAS Adhesin Island-2" or "GAS AI-2," has also been 
identified in GAS serotypes. Amino acid sequences encoded by the open reading frames of GAS AI- 
2 may also be used in immunogenic compositions for the treatment or prevention of GAS infection. 

30 A preferred immunogenic composition of the invention comprises a GAS AI-2 surface 

protein which may be formulated or purified in an oligomeric (pilus) form. In a preferred 
embodiment, the oligomeric form is a hyperoligomer. A preferred immunogenic composition of the 
invention alternatively comprises an isolated GAS AI-2 surface protein in oligomeric (pilus) form. 
The oligomer or hyperoligomeric pilus structures comprising GAS AI-2 surface proteins may be 

35 purified or otherwise formulated for use in immunogenic compositions. 

GAS AI-2 comprises a series of approximately eight open reading frames encoding for a 
collection of amino acid sequences comprising surface proteins and sortases ("GAS AI-2 proteins"). 



-9- 



WO 2006/078318 PCT/US2005/027239 

GpOSJAI^ pfCS®lSIM&P^iM^4fl^3 gpDteins, a srtB sortase, a srtCl sortase and a rofA divergently 
transcribed transcriptional regulator. 

Specifically, GAS AI-2 includes polynucleotide sequences encoding for two or more of 
GAS 15, Spy0127, GAS 16, GAS 17, GAS18, Spy0131, Spy0133, and GAS20. 
5 One or more of the GAS AI-2 polynucleotide sequences may be replaced by a polynucleotide 

sequence coding for a fragment of the replaced ORF. Alternatively, one or more of the GAS AI-2 
open reading frames may be replaced by a sequence having sequence homology (sequence identity) to 
the replaced ORF. 

One or more of the GAS AI-2 surface proteins typically include an LPXTG motif (such as 
10 LPXTG (SEQ ID NO: 122)) or other sortase substrate motif. These sortase proteins are thought to be 
involved in the secretion and anchoring of the LPXTG containing surface proteins. GAS AI-2 may 
encode for at least one surface protein. Alternatively, GAS AI-2 may encode for at least two surface 
proteins and at least one sortase. Preferably, GAS AI-2 encodes for at least three surface proteins and 
at least two sortases. One or more of the surface proteins may include an LPXTG, motif. 
15 GAS AI-2 preferably includes a srtB sortase and a srtCl sortase. As discussed above, GAS 

srtB sortases may preferably anchor surface proteins with an LPSTG motif (SEQ ID NO: 166), 
particularly where the motif is followed by a serine. GAS srtCl sortase may preferentially anchor 
surface proteins with a V(P/V)PTG (SEQ ID NO: 167) motif. GAS srtCl may be differentially 
regulated by rofA. 

20 The GAS AI-2 protein of the composition may be selected from the group consisting of 

GAS 15, Spy0127, GAS 16, GAS 17, GAS 18, Spy0131, Spy0133, and GAS20. GAS AI-2 surface 
proteins GAS 15 (Cpa), GAS 16 (thought to be a fimbrial protein, Ml_128), GAS 18 (Ml_Spy0130), 
and GAS20 are preferred for use in the immunogenic compositions of the invention. GAS 16 is 
thought to form the shaft portion of the pilus like structure, while GAS 15 (the collagen adhesion 

25 protein Cpa) and GAS 18 are thought to act as accessory proteins facilitating the formation of the 

pilus structure, exposed on the surface of the bacterial capsule. Preferably, each of these GAS AI-2 
surface proteins includes an LPXTG sortase substrate motif, such as LPXTG (SEQ ID NO: 122), 
VVXTG (SEQ ID NO: 135), or EVXTG (SEQ ID NO: 136). 

In addition to the open reading frames encoding the GAS AI-2 proteins, GAS AI-2 may also 

30 include a divergently transcribed transcriptional regulator such as rofA {i.e., the transcriptional 

regulator is located near or adjacent to the GAS AI protein open reading frames, but it transcribed in 
the opposite direction). The GAS AI-2 surface proteins may be used alone, in combination with other 
GAS AI-2 surface proteins or in combination with other GAS AI surface proteins. Preferably, the 
immunogenic compositions of the invention include the GAS AI-2 fimbrial protein (GAS 16), the 

35 GAS AI-2 collagen binding protein (GAS 15) and GAS 18 (Ml_Spy0130). More preferably, the 

immunogenic compositions of the invention include the GAS AI-2 fimbrial protein (GAS 16). 

A third GAS adhesion island, "GAS Adhesin Island-3" or "GAS AI-3," has also been 

identified in numerous GAS serotypes. Amino acid sequences encoded by the open reading frames of 
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0|&SJ};A.I 1 }p ^4yi4lgMl!S^!:MiS©d J^rt; pii§^ig^|kic compositions for the treatment or prevention of GAS 

infection. 

A preferred immunogenic composition of the invention comprises a GAS AI-3 surface 
protein which may be formulated or purified in an oligomeric (pilus) form. In a preferred 
5 embodiment, the oligomeric form is a hyperoligomer. A preferred immunogenic composition of the 
invention alternatively comprises an isolated GAS AI-3 surface protein in oligomeric (pilus) form. 
The oligomer or hyperoligomeric pilus structures comprising GAS AI-3 surface proteins may be 
purified or otherwise formulated for use in immunogenic compositions. GAS AI-3 comprises a series 
of approximately seven open reading frames encoding for a collection of amino acid sequences 

10 comprising surface proteins and sortases ("GAS AI-3 proteins"). GAS AI-3 preferably comprises 
surface proteins, a srtC2 sortase, and a Negative transcriptional regulator (Nra) divergently 
transcribed transcriptional regulator. GAS AI-3 surface proteins may include a collagen binding 
protein, a fimbrial protein, and a F2 like fibronectin-binding protein. GAS AI-3 surface proteins may 
also include a hypothetical surface protein. The fimbrial protein is thought to form the shaft portion 

15 of the pilus like structure, while the collagen adhesion protein (Cpa) and the hypothetical surface 
protein are thought to act as accessory proteins facilitating the formation of the pilus structure, 
exposed on the surface of the bacterial capsule. Preferred AI-3 surface proteins include the fimbrial 
proein, the collagen binding protein and the hypothetical protein. Preferably, each of these GAS AI-3 
surface proteins include an LPXTG sortase substrate motif, such as LPXTG (SEQ ID NO: 122), 

20 VPXTG (SEQ ID NO: 137), QVXTG (SEQ ID NO: 138) or LPXAG (SEQ ID NO: 139). 

Specifically, GAS AI-3 includes polynucleotide sequences encoding for two or more of 
SpyM3_0098, SpyM3J)099, SpyM3_0100, SpyM3_0101, SpyM3_0102, SpyM3_0103, 
SpyM3_0104, SpsOlOO, SpsOlOl, Sps0102, Sps0103, Sps0104, SpsOlOS, Sps0106, orf78, orf79, 
orf80, orf81, orf82, orf83, orf84, spyM18__0126, spyM18_0127, spyM18_0128, spyM18_0129, 

25 spyM18J>130, spyM18_0131, spyM18J)132, SpyoM01000156, SpyoMO 1000 155, SpyoMO 1000 154, 
SpyoMO 1 000 1 53 , SpyoMO 1 000 1 52, SpyoMO 1000151, SpyoMO 100015 0, SpyoMO 1 000 1 49, 
ISS3040_fimbrial, ISS3776_fimbrial, and ISS4959_fimbrial. In one embodiment, GAS AI-3 may 
include open reading frames encoding for two or more of SpyM3_0098, SpyM3_0099, SpyM30100, 
SpyM3__0101, SpyM3_0102, SpyM3__0103, and SpyM3J)104. Alternatively, GAS AI-3 may include 

30 open reading frames encoding for two or more of SpsOlOO, SpsOlOl, Sps0102, Sps0103, Sps0104, 
Sps0105, and Sps0106. Alternatively, GAS AI-3 may include open reading frames encoding for two 
or more of orf78, orf79, orfSO, orf81, orf82, orf83, and orf84. Alternatively, GAS AI-3 may include 
open reading frames encoding for two or more of spyM18 0126, spyM18_0127, spyM18_0128, 
spyM18_0129, spyM18J>130, spyM18_0131, and spyM18J)132. Alternatively, GAS AI-3 may 

35 include open reading frames encoding for two or more of SpyoMO 1000 156, SpyoMO 1000 155, 

SpyoMO 1000 154, SpyoMO 1000 153, SpyoMO 1000 152, SpyoMO 10001 51, SpyoMO 1000 150, and 

SpyoMO 1000 149. Alternatively, GAS AI-1 may also include polynucleotide sequences encoding for 

any one of ISS3040jSmbrial, ISS3776_fimbrial, and ISS4959_fimbrial. 
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}p It;;;;; 'pnd' ^l2^i&iliB;iM!S;it^l^' ^a^S^^S^^ir^^Xft^n^cleoticie sequences may be replaced by a polynucleotide 
sequence coding for a fragment of the replaced ORF. Alternatively, one or more of the GAS AI-3 
open reading frames may be replaced by a sequence having sequence homology (sequence identity) to 
the replaced ORF. 

One or more of the GAS AI-3 surface proteins typically include an LPXTG motif (such as 
LPXTG (SEQ ID NO: 122)) or other sortase substrate motif. These sortase proteins are thought to be 
involved in the secretion and anchoring of the LPXTG containing surface proteins. GAS AI-3 may 
encode for at least one surface protein. Alternatively, GAS AI-3 may encode for at least two surface 
proteins and at least one sortase. Preferably, GAS AI-3 encodes for at least three surface proteins and 
at least two sortases. One or more of the surface proteins may include an LPXTG motif. 

GAS AI-3 preferably includes a srtC2 type sortase. GAS srtC2 type sortases may preferably 
anchor surface proteins with a QVPTG (SEQ ID NO: 140) motif, particularly when the motif is 
followed by a hydrophobic region and a charged C terminus tail. GAS SrtC2 may be differentially 
regulated by Nra. 

The GAS AI-3 protein of the composition may be selected from the group consisting of 
SpyM3_0098, SpyM3_0099, SpyM3_0100, SpyM3_0101, SpyM3 0102, SpyM3_0103, 
SpyM3_0104, SpsOlOO, SpsOlOl, Sps0102, Sps0103, Sps0104, Sps0105, Sps0106, orf78, orf79, 
orf80, orf81, orf82, orf83, orf84, spyM 18J) 126, spyM 18J) 127, spyM18_0128, spyM18_0129, 
spyM 1 8_0 1 3 0, spyM 1 8_0 1 3 1 7 spyM18_0132, SpyoM01000156, SpyoMO 1000 155, SpyoMO 1000 154, 
SpyoMO 1000 153, SpyoMO 1000 152, SpyoM01000151, SpyoMO 1000 150, SpyoMO 1000 149, 
ISS3040_fimbrial, ISS3776j&mbrial, and ISS4959jfimbrial. GAS AI-3 surface proteins 
SpyM3 _0098, SpyM3_0100, SpyM3__0102, SpyM3_0104, SPsOlOO, SPs0102, SPs0104, SPs0106, 
orf78, orf80, orf82, orf84, spyM18J)126, spyM18_0128, spyM18_0130, spyM18_0132, 
SpyoM01000155, SpyoMO 1000 153, SpyoMO 1000 151, SpyoMO 1000 149, ISS3040_fimbrial, 
ISS3776_fimbrial, and ISS4959_fimbrial are preferred GAS AI-3 proteins for use in the immunogenic 
compositions of the invention. 

In addition to the open reading frames encoding the GAS AI-3 proteins, GAS AI-3 may also 
include a transcriptional regulator such as Nra. 

GAS AI-3 may also include a LepA putative signal peptidase I protein. 
The GAS AI-3 surface proteins may be used alone, in combination with other GAS AI-3 
surface proteins or in combination with other GAS AI surface proteins. Preferably, the immunogenic 
compositions of the invention include the GAS AI-3 fimbrial protein, the GAS AI-3 collagen binding 
protein, the GAS AI-3 surface protein (such as SpyM3_0102, M3_Sps0104, M5__prf82, or 
spyM18_O130), and Fibronectin binding protein PrtF2. More preferably, the immunogenic 
compositions of the invention include the GAS AI-3 fimbrial protein, the GAS AI-3 collagen binding 
protein, and the GAS AI-3 surface protein. Still more preferably, the immunogenic compositions of 
the invention include the GAS AI-3 fimbrial protein. 
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p C ''F^ybSMIMi^^IgB'atlB ^AS AI-3 fimbrial protein include SpyM3J)100, 
M3__Sps0102, M5_orf80, spyM18_128, SpyoMO 1000 153, ISS3040__fimbrial, ISS3776Jimbrial, 
ISS4959j&ntarial. 

Representative examples of the GAS AI-3 collagen binding protein include SpyM3 J)098, 
M3_Sps0100, M5_orf 78, spyM18_0126, and SpyoMO 1000 155. 

Representative examples of the GAS AI-3 flbronectin binding protein PrtF2 include 
SpyM3_0104, M3JSps0106, M5_prf84 and spyM18J)132, and SpyoMO 1000 149. 

A fourth GAS adhesion island, "GAS Adhesin Island-4" or "GAS AI-4," has also been 
identified in GAS serotypes. Amino acid sequences encoded by the open reading frames of GAS AI- 
4 may also be used in immunogenic compositions for the treatment or prevention of GAS infection. 

A preferred immunogenic composition of the invention comprises a GAS AI-4 surface 
protein which may be formulated or purified in an oligomeric (pilus) form. In a preferred 
embodiment, the oligomeric form is a hyperoligomer. A preferred immunogenic composition of the 
invention alternatively comprises an isolated GAS AI-4 surface protein in oligomeric (pilus) form. 
The oligomer or hyperoligomeric pilus structures comprising GAS AI-3 surface proteins may be 
purified or otherwise formulated for use in immunogenic compositions. The oligomeric or 
hyperoligomeric pilus structures comprising GAS AI-4 surface proteins may be purified or otherwise 
formulated for use in immunogenic compositions. 

GAS AI-4 comprises a series of approximately eight open reading frames encoding for a 
collection of amino acid sequences comprising surface proteins and sortases ("GAS AI-4 proteins"). 
This GAS adhesin island 4 ("GAS AI-4") comprises surface proteins, a srtC2 sortase, and a RofA 
regulatory protein. GAS AI-4 surface proteins within may include a fimbrial protein, Fl and F2 like 
fibronectin-binding proteins, and a capsular polysaccharide adhesion protein (cpa). GAS AI-4 surface 
proteins may also include a hypothetical surface protein in an open reading frame (orf). 

The fimbral protein (EftLSL) is thought to form the shaft portion of the pilus like structure, 
while the collagen adhesion protein (Cpa) and the hypothetical protein are thought to act as accessory 
proteins facilitating the formation of the pilus structure, exposed on the surface of the bacterial 
capsule. Preferably, each of these GAS AI-4 surface proteins include an LPXTG sortase substrate 
motif, such as LPXTG (SEQ ID NO: 122), VPXTG (SEQ ID NO: 137), QVXTG (SEQ ID NO: 138) 
or LPXAG (SEQ ID NO: 139). 

Specifically, GAS AI-4 includes polynucleotide sequences encoding for two or more of 
19224134, 19224135, 19224136, 19224137, 19224138, 19224139, 19224140, and 19224141. A GAS 
AI-4 polynucleotide may also include polynucleotide sequences encoding for any one of 
20010296 Jimbrial, 20020069 Jimbrial, CDC SS 635_fimbrial, ISS4883_fimbrial, ISS4538_fimbrial. 
One or more of the GAS AI-4 polynucleotide sequences may be replaced by a polynucleotide 
sequence coding for a fragment of the replaced ORF. Alternatively, one or more of the GAS AI-4 
open reading frames may be replaced by a sequence having sequence homology (sequence identity) to 
the replaced ORF. 
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p C ' W %Wip€llci!Eit3^' ^yfiHFi: Wace proteins typically include an LPXTG motif (such as 
LPXTG (SEQ ID NO: 122)) or other sortase substrate motif. These sortase proteins are thought to be 
involved in the secretion and anchoring of the LPXTG containing surface proteins. GAS AI-4 may 
encode for at least one surface protein. Alternatively, GAS AI-4 may encode for at least two surface 
proteins and at least one sortase. Preferably, GAS AI-4 encodes for at least three surface proteins and 
at least two sortases. One or more of the surface proteins may include an LPXTG motif. 

GAS AI-4 includes a SrtC2 type sortase. GAS SrtC2 type sortases may preferably anchor 
surface proteins with a QVPTG (SEQ ID NO: 140) motif, particularly when the motif is followed by a 
hydrophobic region and a charged C terminus tail. 

The GAS AI-4 protein of the composition may be selected from the group consisting of 
19224134, 19224135, 19224136, 19224137, 19224138, 19224139, 19224140, 19224141, 
20010296jfimbrial, 20020069 Jfimbrial, CDC SS 635J5mbrial, ISS4883JSmbrial, and 
ISS4538_fimbrial. GAS AI-4 surface proteins 19224134, 19224135, 19224137, 19224139, 
19224141, 2001 0296 Jimbrial, 20020069_fimbrial, CDC SS 635_fimbrial, ISS4883__fimbrial, 
IS S45 3 8 Jimbrial are preferred proteins for use in the immunogenic compositions of the invention. 

In addition to the open reading frames encoding the GAS AI-4 proteins, GAS AI-4 may also 
include a divergently transcribed transcriptional regulator such as RofA (i.e., the transcriptional 
regulator is located near or adjacent to the AI protein open reading frames, but it transcribed in the 
opposite direction. 

GAS AI-4 may also include a LepA putative signal peptidase I protein and a MsmRL protein. 
The GAS AI-4 surface proteins may be used alone, in combination with other GAS AI-4 surface 
proteins or in combination with other GAS AI surface proteins. Preferably, the immunogenic 
compositions of the invention include the GAS AI-4 fimbrial protein (EftLSL or 2001 029 6_fimbrial, 
20020069_fimbrial, CDC SS 635J5mbrial, ISS48 83 Jfimbrial, or ISS4538_fimbrial), the GAS AI-4 
collagen binding protein, the GAS AI-4 surface protein (such as M12 isolate A735 orf 2), and 
fibronectin binding protein PrtFl and PrtF2. More preferably, the immunogenic compositions of the 
invention include the GAS AI-4 fimbrial protein, the GAS AI-4 collagen binding protein, and the 
GAS AI-4 surface protein. Still more preferably, the immunogenic compositions of the invention 
include the GAS AI-4 fimbrial protein. 

The GAS AI proteins of the invention maybe used in immunogenic compositions for 

prophylactic or therapeutic immunization against GAS infection. For example, the invention may 

include an immunogenic composition comprising one or more GAS Al-1 proteins and one or more of 

any of GAS AI-2, GAS AI-3, or GAS AI-4 proteins. For example, the invention includes an 

immunogenic composition comprising at least two GAS AI proteins where each protein is selected 

from a different GAS adhesin island. The two GAS AI proteins may be selected from one of the 

following GAS AI combinations: GAS AM and GAS AI-2; GAS AI-1 and GAS AI-3; GAS AI-1 

and GAS AI-4; GAS AI-2 and GAS AI-3; GAS AI-2 and GAS AI-4; and GAS AI 3 and GAS AI-4. 

Preferably the combination includes fimbrial proteins from one or more GAS adhesin islands. 
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p CI THttf foP^^ also be selected to provide protection against an 

increased range of GAS serotypes and strain isolates. For example, the immunogenic composition 
may comprise a first and second GAS AI protein, wherein a full length polynucleotide sequence 
encoding for the first GAS AI protein is not present in a genome comprising a full length 
5 polynucleotide sequence encoding for the second GAS AI protein. In addition, each antigen selected 
for use in the immunogenic compositions will preferably be present in the genomes of multiple GAS 
serotypes and strain isolates. Preferably, each antigen is present in the genomes of at least two (i.e., 3, 

4, 5, 6, 7, 8, 9, 10, or more) GAS strain isolates. More preferably, each antigen is present in the 
genomes of at least two (z.e., at least 3, 4, 5, or more) GAS serotypes. 

1 0 Applicants have also identified adhesin islands within the genome of Streptococcus 

pneumoniae. These adhesion islands are thought to encode surface proteins which are important in 
the bacteria's virulence. Amino acid sequence encoded by such S. pneumoniae Adhesin Islands may 
be used in immunogenic compositions for the treatment or prevention of S. pneumoniae infection. 
Preferred immunogenic compositions of the invention comprise a S. pneumoniae AI surface protein 

1 5 which has been formulated or purified in an oligomeric (pilus) form. In a preferred embodiment, the 
oligomeric form is a hyperoligomer. A preferred immunogenic composition of the invention 
alternatively comprises an isolated S. pneumoniae surface protein in oligomeric (pilus) form. The 
oligomer or hyperoligomeric pilus structures comprising S. pneumoniae surface proteins, may be 
purified or otherwise formulated for use in immunogenic compositions. 

20 The S. pneumoniae Adhesin Islands generally include a series of open reading frames within a 

5. pneumoniae genome that encode for a collection of surface proteins and sortases. A S. pneumoniae 
Adhesin Island may encode for an amino acid sequence comprising at least one surface protein. 
Alternatively, the S. pneumoniae Adhesin Island may encode for at least two surface proteins and at 
least one sortase. Preferably, a S. pneumoniae Adhesin Island encodes for at least three surface 

25 proteins and at least two sortases. One or more of the surface proteins may include an LPTXG motif 
(such as LPXTG (SEQ ID NO: 122)) or other sortase substrate motif. One or more S. pneumoniae AI 
surface proteins may participate in the formation of a pilus structure on the surface of the S. 
pneumoniae bacteria. 

The S. pneumoniae Adhesin Islands of the invention preferably include a divergently 
30 transcribed transcriptional regulator. The transcriptional regulator may regulate the expression of the 
S. pneumonaie AI operon. An example of a transcriptional regulator found in S. pneumoniae AI 
sequences is rlrA. 

A schematic of the organization of a S. pneumoniae AI locus is provided in Figure 137. The 
locus comprises open reading frames encoding a transcriptional regulator (rlrA), cell wall surface 
35 proteins (rrgA, rrgB, rrgC) and sortases (srt B, srtC, srtD). 

S. pneumoniae AI sequences may be generally divided into two groups of homology, S. 
pneuamoniae Al-a and Al-b. S. pneumoniae strains that comprise Al-a include 14 CSR 10, 19A 

-15- 



WO 2006/078318 PCT/US2005/027239 

I^pi|ar5[j'6 ? .S^ pjijjq|l^d:;15'; jpffCJ^pjFiptepd 12, and 6B Spain 2. S. pneumoniae AI strains that 
comprise Al-b include 19F Taiwan 14, 9V Spain 3, 23F Taiwan 15 and TIGR4. 

S. pneumoniae AI from TIGR4 comprises a series of approximately seven open reading 
frames encoding for a collection of amino acid sequences comprising surface proteins and sortases 
5 ("& pneumoniae AI proteins"). Specifically, S. pneumoniae AI from TIGR4 includes polynucleotide 
sequences encoding for two or more of SP0462, SP0463, SP0464, SP0465, SP0466, SP0467, and 
SP0468. 

One or more of the S. pneumoniae AI from TIGR4 polynucleotide sequences may be replaced 
by a polynucleotide sequence coding for a fragment of the replaced ORF. Alternatively, one or more 
10 of the S. pneumoniae AI from TIGR4 open reading frames may be replaced by a sequence having 
sequence homology to the replaced ORF. 

S. pneumoniae strain 670 AI comprises a series of approximately seven open reading frames 
encoding for a collection of amino acid sequences comprising surface proteins and sortases ("& 
pneumoniae AI proteins"). Specifically, S. pneumoniae strain 670 AI includes polynucleotide 
15 sequences encoding for two or more of orfl_670, orf3 J570, orf4__670, orf5_670, orf6_670, orf7_670, 
and orf8_670. 

One or more of the S. pneumoniae strain 670 AI polynucleotide sequences may be replaced 
by a polynucleotide sequence coding for a fragment of the replaced ORF. Alternatively, one or more 
of the S. pneumoniae strain 670 AI open reading frames may be replaced by a sequence having 

20 sequence homology to the replaced ORF. 

S. pneumoniae AI from 14 CSR10 comprises a series of approximately seven open reading 
frames encoding for a collection of amino acid sequences comprising surface proteins and sortases 
("$. pneumoniae AI proteins"). Specifically, S. pneumoniae AI from 14 CSR10 includes 
polynucleotide sequences encoding for two or more of ORF2_14CSR, ORF3_14CSR, ORF4_14CSR, 

25 ORF5_14CSR, ORF6_14CSR, ORF7_14CSR, and ORF8_14CSR. 

One or more of the S. pneumoniae AI from 14 CSR10 polynucleotide sequences may be 
replaced by a polynucleotide sequence coding for a fragment of the replaced ORF. Alternatively, one 
or more of the S, pneumoniae AI from 14 CSR10 open reading frames may be replaced by a sequence 
having sequence homology to the replaced ORF. 

30 S. pneumoniae AI from 19A Hungary 6 comprises a series of approximately seven open 

reading frames encoding for a collection of amino acid sequences comprising surface proteins and 
sortases ("£. pneumoniae AI proteins"). Specifically, S. pneumoniae AI from 19A Hungary 6 
includes polynucleotide sequences encoding for two or more of ORF219AH, ORF3_19AH, 
ORF4_19AH, ORF5_19AH, ORF6_19AH, ORF7J9AH, and ORF8_19AH. 

35 One or more of the S. pneumoniae AI from 19A Hungary 6 polynucleotide sequences may be 

replaced by a polynucleotide sequence coding for a fragment of the replaced ORF. Alternatively, one 

or more of the S. pneumoniae AI from 19A Hungary 6 open reading frames may be replaced by a 

sequence having sequence homology to the replaced ORF. 
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p \C ^ pnUMM^M^mMMEW^hn 14 comprises a series of approximately seven open 

reading frames encoding for a collection of amino acid sequences comprising surface proteins and 

sortases pneumoniae AI proteins")- Specifically, & pneumoniae AI from 19F Taiwan 14 includes 

polynucleotide sequences encoding for two or more of ORF2_19FTW, ORF3_19FTW, 

5 ORF419FTW, ORF519FTW, ORF6„19FTW, ORF7_19FTW, and ORF8J9FTW. 

One or more of the S. pneumoniae AI from 19F Taiwan 14 polynucleotide sequences may be 

replaced by a polynucleotide sequence coding for a fragment of the replaced ORF. Alternatively, one 

or more of the S. pneumoniae AI from 19F Taiwan 14 open reading frames may be replaced by a 

sequence having sequence homology to the replaced ORF. 

10 S. pneumoniae AI from 23F Poland 16 comprises a series of approximately seven open 

reading frames encoding for a collection of amino acid sequences comprising surface proteins and 

sortases ("£. pneumoniae AI proteins"). Specifically, S. pneumoniae AI from 23F Poland 1 6 includes 

polynucleotide sequences encoding for two or more of ORF2_23FP, ORF3_23FP, ORF4_23FP, 

ORF5_23FP, ORF6_23FP, ORF7_23FP, and ORF8_23FP. 

15 One or more of the S. pneumoniae AI from 23F Poland 16 polynucleotide sequences may be 

replaced by a polynucleotide sequence coding for a fragment of the replaced ORF. Alternatively, one 

or more of the S. pneumoniae AI from 23F Poland 16 open reading frames may be replaced by a 

sequence having sequence homology to the replaced ORF. 

S. pneumoniae AI from 23F Taiwan 15 comprises a series of approximately seven open 

20 reading frames encoding for a collection of amino acid sequences comprising surface proteins and 

sortases ("& pneumoniae AI proteins"). Specifically, S. pneumoniae AI from 23F Taiwan 15 includes 

polynucleotide sequences encoding for two or more of ORF2_23FTW, ORF3_23FTW, 

ORF4_23FTW, ORF5_23FTW, ORF6_23FTW, ORF7_23FTW, and ORF8__23FTW. 

One or more of the S. pneumoniae AI from 23F Taiwan 15 polynucleotide sequences may be 

25 replaced by a polynucleotide sequence coding for a fragment of the replaced ORF. Alternatively, one 

or more of the S. pneumoniae AI from 23F Taiwan 15 open reading frames may be replaced by a 

sequence having sequence homology to the replaced ORF. 

& pneumoniae AI from 6B Finland 12 comprises a series of approximately seven open 

reading frames encoding for a collection of amino acid sequences comprising surface proteins and 

30 sortases pneumoniae AI proteins"). Specifically, S. pneumoniae AI from 6B Finland 12 includes 

polynucleotide sequences encoding for two or more of ORF26BF, ORF3_6BF, ORF4 6BF, 

ORF5_6BF, ORF6_6BF, ORF7__6BF, and ORF8_6BF. 

One or more of the S. pneumoniae AI from 6B Finland 12 polynucleotide sequences may be 

replaced by a polynucleotide sequence coding for a fragment of the replaced ORF. Alternatively, one 

35 or more of the S. pneumoniae AI from 6B Finland 12 open reading frames may be replaced by a 

sequence having sequence homology to the replaced ORF. 

S. pneumoniae AI from 6B Spain 2 comprises a series of approximately seven open reading 

frames encoding for a collection of amino acid sequences comprising surface proteins and sortases 
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('!$■ ®^w^y$^ S. pneumoniae AI from 6B Spain 2 includes 

polynucleotide sequences encoding for two or more of ORF2_6BSP, ORF3_6BSP, ORF4 6BSP, 
ORF5J5BSP, ORF6_6BSP, ORF7J5BSP, and ORF8_6BSP. 

One or more of the S. pneumoniae AI from 6B Spain 2 polynucleotide sequences may be 
5 replaced by a polynucleotide sequence coding for a fragment of the replaced ORF. Alternatively, one 
or more of the & pneumoniae AI from 6B Spain 2 open reading frames may be replaced by a 
sequence having sequence homology to the replaced ORF. 

S. pneumoniae AI from 9V Spain 3 comprises a series of approximately seven open reading 
frames encoding for a collection of amino acid sequences comprising surface proteins and sortases 

10 ("£. pneumoniae AI proteins")- Specifically, S. pneumoniae AI from 9V Spain 3 includes 

polynucleotide sequences encoding for two or more of ORF2_9VSP, ORF3_9VSP, ORF4_9VSP, 
ORF5_9VSP, ORF6_9VSP, ORF7_9VSP, and ORF8_9VSP. 

One or more of the S. pneumoniae AI from 9V Spain 3 polynucleotide sequences may be 
replaced by a polynucleotide sequence coding for a fragment of the replaced ORF. Alternatively, one 

15 or more of the S. pneumoniae AI from 9V Spain 3 open reading frames may be replaced by a 
sequence having sequence homology to the replaced ORF. 

One or more of the S, pneumoniae AI surface proteins typically include an LPXTG motif 
(such as LPXTG (SEQ ID NO: 122)) or other sortase substrate motif. These sortase proteins are 
thought to be involved in the secretion and anchoring of the LPXTG containing surface proteins. S. 

20 pneumoniae AI may encode for at least one surface protein. The Adhesin Island, may encode at least 
one surface protein. Alternatively, S. pneumoniae AI may encode for at least two surface proteins and 
at least one sortase. Preferably, S. pneumoniae AI encodes for at least three surface proteins and at 
least two sortases. One or more of the surface proteins may include an LPXTG motif. 

The S. pneumoniae AI protein of the composition may be selected from the group consisting 

25 of SP0462, SP0463, SP0464, SP0465, SP0466, SP0467, SP0468, orfl_670, orf3_670, orf4___670, 
orf5j570, orf6_670, orf7__670, orf8_670, ORF2_14CSR, ORF3J4CSR, ORF4_14CSR, 
ORF5J4CSR, ORF6_14CSR, ORF7_14CSR, ORF8_14CSR, ORF2_19AH, ORF3_19AH, 
ORF4 19AH, ORF5_19AH, ORF6_19AH, ORF 7_ 19 AH, ORF8_19AH, ORF2__19FTW, 
ORF3_19FTW, ORF4_19FTW, ORF5_19FTW, ORF6_19FTW, ORF7_19FTW, ORF8_19FTW, 

30 ORF2 23FP, ORF3 23FP, ORF4 23FP, ORF5 23FP, ORF6_23FP, ORF7_23FP, ORF8_23FP, 
ORF2_23FTW, ORF3 23FTW, ORF4 23FTW, ORF5_23FTW, ORF6_23FTW, ORF7_23FTW, 
ORF8_23FTW, ORF2_6BF 3 ORF3_6BF, ORF4_6BF, ORF5_6BF, ORF6_6BF, ORF7_6BF, 
ORF8_6BF, ORF2_6BSP, ORF3_6BSP, ORF4_6BSP, ORF5_6BSP, ORF6_6BSP, ORF7_6BSP, 
ORF8_6BSP, ORF2_9VSP, ORF3_9VSP, ORF4_9VSP, ORF5_9VSP, ORF6_9VSP, ORF7_9VSP 

35 and, ORF8_9VSP. 

S. pneumoniae AI surface proteins are preferred proteins for use in the immunogenic 

compositions of the invention. In one embodiment, the compositions of the invention comprise 

combinations of two or more S pneumoniae AI surface proteins. Preferably such combinations are 
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sg&<|jf^ of SP0462, SP0463, SP0464, orf3_670, orf4_670, 

orf5_670, ORF3_14CSR, ORF4_14CSR, ORF5_14CSR, ORF3_19AH, ORF4 19AH, ORF5_19AH, 
ORF3_19FTW, ORF4__19FTW, ORF5__19FTW, ORF3_23FP, ORF4_23FP, ORF5_23FP, 
ORF3_23FTW, ORF4_23FTW, ORF5J23FTW, ORF3_6BF, ORF4J5BF, ORF5J5BF, ORF3J5BSP, 
ORF4_6BSP, ORF5_6BSP, ORF3__9VSP, ORF4_9VSP, and ORF5_9VSP. 

In addition to the open reading frames encoding the S. pneumoniae AI proteins, & 
pneumoniae AI may also include a transcriptional regulator. 

The S. pneumoniae AI proteins of the invention may be used in immunogenic compositions 
for prophylactic or therapeutic immunization against S. pneumoniae infection. For example, the 
invention may include an immunogenic composition comprising one or more S. pneumoniae from 
TIGR4 AI proteins and one or more S. pneumoniae strain 670 proteins. The immunogenic 
composition may comprise one or more AI proteins from any one or more of S. pneumoniae strains 
TIGR4, 19A Hungary 6, 6B Finland 12, 6B Spain 2, 9V Spain 3, 14 CSR 10, 19F Taiwan 14, 23F 
Taiwan 15, 23F Poland 16, and 670. 

The immunogenic compositions may also be selected to provide protection against an 
increased range of S. pneumoniae serotypes and strain isolates. For example, the immunogenic 
composition may comprise a first and second S. pneumoniae AI protein, wherein a full length 
polynucleotide sequence encoding for the first S. pneumoniae AI protein is not present in a genome 
comprising a full length polynucleotide sequence encoding for the second S. pneumoniae AI protein. 
In addition, each antigen selected for use in the immunogenic compositions will preferably be present 
in the genomes of multiple S. pneumoniae serotypes and strain isolates. Preferably, each antigen is 
present in the genomes of at least two (i.e., 3, 4, 5, 6, 7, 8, 9, 10, or more) S. pneumoniae strain 
isolates. More preferably, each antigen is present in the genomes of at least two {i.e., at least 3, 4, 5, 
or more) S. pneumoniae serotypes. 

The immunogenic compositions may also be selected to provide protection against an 
increased range of serotypes and strain isolates of a Gram positive bacteria. For example, the 
immunogenic composition may comprise a first and second Gram positive bacteria AI protein, 
wherein a full length polynucleotide sequence encoding for the first Gram positive bacteria AI protein 
is not present in a genome comprising a full length polynucleotide sequence encoding for the second 
Gram positive bacteria AI protein. In addition, each antigen selected for use in the immunogenic 
compositions will preferably be present in the genomes of multiple serotypes and strain isolates of the 
Gram positive bacteria. Preferably, each antigen is present in the genomes of at least two (i.e., 3, 4, 5, 
6, 7, 8, 9, 10, or more) Gram positive bacteria strain isolates. More preferably, each antigen is present 
in the genomes of at least two (i.e., at least 3, 4, 5, or more) Gram positive bacteria serotypes.One or 
both of the first and second AI proteins may preferably be in oligomeric or hyperoligomeric form. 

Adhesin island surface proteins from two or more Gram positive bacterial genus or species 
may be combined to provide an immunogenic composition for prophylactic or therapeutic treatment 
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ol^igasp-o.F^fegijfl ^q;;^^^;:|©i;gjp^;^|ositive bacterial genus or species. Optionally, the adhesin 
island surface proteins may be associated together in an oligomeric or hyperoligomeric structure. 

In one embodiment, the invention comprises adhesin island surface proteins from two or more 
Streptococcus species. For example, the invention includes a composition comprising a GBS AI 
5 surface protein and a GAS adhesin island surface protein. As another example, the invention includes 
a composition comprising a GAS adhesin island surface protein and a S. pneumoniae adhesin island 
surface protein. One or both of the GAS AI surface protein and the S. pneumoniae AI surface protein 
may be in oligomeric or hyperoligomeric form. As a further example, the invention includes a 
composition comprising a GBS adhesin island surface protein and a S. pneumoniae adhesin island 
10 surface protein. 

In one embodiment, the invention comprises an adhesin island surface protein from two or 
more Gram positive bacterial genus. For example, the invention includes a composition comprising a 
Streptococcus adhesin island protein and a Cory neb act erium adhesin island protein. One or more of 
the Gram positive bacteria AI surface proteins may be in an oligomeric or hyperoligomeric form. 

15 In addition, the AI polynucleotides and amino acid sequences of the invention may also be 

used in diagnostics to identify the presence or absence of GBS (or a Gram positive bacteria) in a 
biological sample. They may be used to generate antibodies which can be used to identify the 
presence of absence of an AI protein in a biological sample or in a prophylactic or therapeutic 
treatment for GBS (or a Gram positive bacterial) infection. Further, the AI polynucleotides and amino 

20 acid sequences of the invention may also be used to identify small molecule compounds which inhibit 
or decrease the virulence associated activity of the AI. 

BRIEF DESCRIPTION OF THE FIGURES 
FIGURE 1 presents a schematic depiction of Adhesin Island 1 ("AH") comprising open 
reading frames for GBS 80, GBS 52, SAG0647, SAG0648 and GBS 104. 

25 FIGURE 2 illustrates the identification of AM sequences in several GBS serotypes and strain 

isolates (GBS serotype V, strain isolate 2603; GBS serotype III, strain isolate nem3 16; GBS serotype 
II, strain isolate 18RS21; GBS serotype V, strain isolate CJB111; GBS serotype III, strain isolate 
COH1 and GBS serotype la, strain isolate A909). (An AH was not identified in GBS serotype lb, 
strain isolate H36B or GBS serotype la, strain isolate 515). 

30 FIGURE 3 presents a schematic depiction of the correlation between AI-1 and the Adhesin 

Island 2 ("AI-2") within the GBS serotype V, strain isolate 2603 genome. (This AI-2 comprises open 
reading frames for GBS 67, GBS 59, SAG1406, SAG1405 and GBS 150). 

FIGURE 4 illustrates the identification of AI-2 comprising open reading frames encoding for 
GBS 67, GBS 59, SAG1406, SAG1404 and GBS 150 (or sequences having sequence homology 

35 thereto) in several GBS serotypes and strain isolates (GBS serotype V, strain isolate 2603; GBS 

serotype III, strain isolate NEM3 16; GBS serotype lb, strain isolate H36B; GBS serotype V, strain 

isolate CJB1 1 1; GBS serotype II, strain isolate 18RS21; and GBS serotype la, strain isolate 515). 

Figure 4 also illustrates the identification of AI-2 comprising open reading frames encoding for 01520 
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Op^&aS^^WB ffiMK/(a®S^aB®3 (spbl), 01524 and 01525 (or sequences having sequence 
homology thereto). 

FIGURE 5 presents data showing that GBS 80 binds to fibronectin and fibrinogen in ELISA. 
FIGURE 6 illustrates that all genes in AM are co-transcribed as an operon. 
5 ' FIGURE 7 presents schematic depictions of in-frame deletion mutations within AI-1 . 

FIGURE 8 presents FACS data showing that GBS 80 is required for surface localization of 
GBS 104. 

FIGURE 9 presents FACS data showing that sortases SAG0647 and SAG0648 play a semi- 
redundant role in surface exposure of GBS 80 and GBS 104. 
10 FIGURE 10 presents Western Blots of the in-frame deletion mutants probed with anti-GBS80 

and anti-GBS 104 antisera. 

FIGURE 11: Electron micrograph of surface exposed pili structures in Streptococcus 
agalactiae containing GBS 80. 

FIGURE 12: PHD predicted secondary structure of GBS 067. 
15 FIGURE 13, 14 and 15: Electron micrograph of surface exposed pili structures of strain 

isolate COH1 of Streptococcus agalactiae containing a plasmid insert encoding GBS 80. 

FIGURE 16 and 17: Electron micrograph of surface exposed pili structure of wild type strain 
isolate COH1 of Streptococcus agalactiae. 

FIGURE 18: Alignment of polynucleotide sequences of Al-1 from serotype V, strain isolates 
20 2603 and CJB 111; serotype II, strain isolate 1 8RS2 1 ; serotype III, strain isolates COH1 and NEM3 1 6; 
and serotype la, strain isolate A909. 

FIGURE 19: Alignment of polynucleotide sequences of AI-2 from serotype V, strain isolates 
2603 and CJB11 1; serotype II, strain isolate 18RS21; serotype lb, strain isolate H36B; and serotype 
la, strain isolate 515, 

25 FIGURE 20: Alignment of polynucleotide sequences of AI-2 from serotype V, strain isolate 

2603 and serotype III, strain isolate NEM3 16. 

FIGURE 21: Alignment of polynucleotide sequences of AI-2 from serotype III, strain isolate 
COH1 and serotype la, strain isolate A909. 

FIGURE 22: Alignment of amino acid sequences of AI-1 surface protein GBS 80 from 
30 serotype V, strain isolates 2603 and CJB 1 1 1; serotype la, strain isolate A909; serotype III, strain 
isolates COH1 andNEM316. 

FIGURE 23: Alignment of amino acid sequences of AI-1 surface protein GBS 104 from 
serotype V, strain isolates 2603 and CJB 111; serotype III, strain isolates COH1 and NEM3 1 6; and 
serotype II, strain isolate 18RS2L 
35 FIGURE 24: Alignment of amino acid sequences of AI-2 surface protein GBS 067 from 

serotype V, strain isolates 2603 and CJB1 1 1; serotype la, strain isolate 515; serotype II, strain isolate 
18RS21; serotype lb, strain isolate H36B; and serotype III, strain isolate NEM3 1 6. 
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p C TPPIJPSO B^^ffifWffi $P§ ciosel y associates with tight junctions and cross the 
monolayer of ME 180 cervical epithelial cells by a paracellular route. 
FIGURE 26: Illustrates GBS infection of ME180 cells. 

FIGURE 27: Illustrates that GBS 80 recombinant protein does not bind to epithelial cells. 
5 FIGURE 28: Illustrates that deletion of GBS 80 does not effect the capacity of GBS strain 

2603 V/R to adhere and invade ME180 cervical epithelial cells. 

FIGURE 29: Illustrates binding of recombinant GBS 104 protein to epithelial cells. 

FIGURE 30: Illustrates that deletion of GBS 104 in the GBS strain COH1, reduces the 
capacity of GBS to adhere to ME180 cervical epithelial cells. 
1 0 FIGURE 3 1 : Illustrates that GBS 80 knockout mutant strain partially loses the ability to 

translocate through an epithelial cell monolayer. 

FIGURE 32: Illustrates that deletion of GBS 104, but not GBS 80, reduces the capacity of 
GBS to invade J774 macrophage-like cell line, 

FIGURE 33: Illustrates that GBS 104 knockout mutant strain translocates through an 
1 5 epithelial monolayer less efficiently than the isogenic wild type. 

FIGURE 34: Negative stained electron micrographs of GBS serotype III, strain isolate 
COH1, containing a plasmid insert to over-express GBS 80. 

FIGURE 35: Electron micrographs of surface exposed pili structures on GBS serotype III, 
strain isolate COH1, containing a plasmid insert to over-express GBS 80, stained with anti-GBS 80 
20 antibodies (visualized with 10 nm gold particles). 

FIGURE 36: Electron micrographs of surface exposed pili structures on GBS serotype III, 
strain isolate COH1, containing a plasmid insert to over-express GBS 80, stained with anti-GBS 80 
antibodies (visualized with 10 nm gold particles). 

FIGURE 37: Electron micrographs of surface exposed pili structures on GBS serotype III, 
25 strain isolate COH1, containing a plasmid insert to over-express GBS 80, stained with anti-GBS 80 
antibodies (visualized with 20 n m gold particles). 

FIGURE 38: Electron micrographs of surface exposed pili structures on GBS serotype III, 
strain isolate COH1, containing a plasmid insert to over-express GBS 80, stained with anti-GBS 104 
antibodies or preimmune sera (visualized with 10 nm gold particles). 
30 FIGURE 39: Electron micrographs of surface exposed pili structures on GBS serotype III, 

strain isolate COH1, containing a plasmid insert to over-express GBS 80, stained with anti-GBS 80 
antibodies (visualized with 20 nm gold particles) and anti-GBS 104 antibodies (visualized with 10 nm 
gold particles). 

FIGURE 40: Electron micrographs of surface exposed pili structures on GBS serotype III, 
35 strain isolate COH1, containing a plasmid insert to over-express GBS 80, stained with anti-GBS 80 

antibodies (visualized with 20 nm gold particles) and anti-GBS 104 antibodies (visualized with 10 nm 
gold particles). 
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Sp C TOFUDSIBO fct'ri^pffl B&08O is necessary for polymer formation and GBS104 and 
sortase SAG0648 are necessary for efficient assembly of pili. 

FIGURE 42: Illustrates that GBS 67 is part of a second pilus and that GBS 80 is polymerized 
in strain 515, 

5 FIGURE 43: Illustrates that two macro-molecules are visible in Cohl, one of which is the 

GBS 80 pilin. 

FIGURE 44: Illustrates pilin assembly. 

FIGURE 45: Illustrates that GBS 52 is a minor component of the GBS pilus. 
FIGURE 46: Illustrates that the pilus is found in the supernatant of a bacterial culture. 
10 FIGURE 47: Illustrates that the pilus is found in the supernatant of bacterial cultures in all 

phases. 

FIGURE 48: Illustrates that in Cohl, only the GBS 80 protein and one sortase (sag0647 or 
sag0648) is required for polymerization. 

FIGURE 49: IEM image of GBS 80 staining of a GBS serotype VIII strain JM9030013 that 
1 5 express pili. 

FIGURE 50: IEM image of GBS 104 staining of a GBS serotype VIII strain JM9030013 that 
express pili. 

FIGURE 51 A: Schematic depiction of open reading frames comprising a GAS AI-2 serotype 
Ml isolate, GAS AI-3 serotype M3, M5, M18, and M49 isolates, a GAS AI-4 serotype M12 isolate, 
20 and an GAS AI-1 serotype M6 isolate. 

FIGURE 51B: Amino acid alignment of SrtCl-type sortase of a GAS AI-2 serotype Ml 
isolate, SrtC2-type sortases of serotype M3, M5, M18, andM49 isolates, and a SrtC2-type sortase of a 
GAS AI-4 serotype M12 isolate. 

FIGURE 52: Amino acid alignment of the capsular polysaccharide adhesion proteins of GAS 
25 AI-4 serotype Ml 2 (A735), GAS AI-3 serotype M5 (Manfredo), S. pyogenes strain MGAS3 15 

serotype M3, S. pyogenes strain SSI-1 serotype M3, S. pyogenes strain MGAS8232 serotype M3, and 
GAS AI-2 serotype ML 

FIGURE 53: Amino acid alignment of F-like fibronectin-binding proteins of GAS AI-4 
serotype M12 (A735) and SL pyogenes strain MGAS 10394 serotype M6. 
30 FIGURE 54: Amino acid alignment of F2-like fibronectin-binding proteins of GAS AI-4 

serotype M12 (A735), S. pyogenes strain MGAS8232 serotype M3, GAS AI-3 strain M5 (Manfredo), 
S. pyogenes strain SSI serotype M3, and S. pyogenes strain MGAS3 15 serotype M3. 

FIGURE 55: Amino acid alignment of fimbrial proteins of GAS AI-4 serotype M12 (A735), 
GAS AI-3 serotype M5 (Manfredo), S. pyogenes strain MGAS315 serotype M3, S. pyogenes strain 
35 SSI serotype M3, S. pyogenes strain MGAS 823 2 serotype M3, and S. pyogenes Ml GAS serotype 
Ml. 
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jp C "1 ^^ffi'lCilS^^l^ iS^SM^t of hypothetical proteins of GAS AI-4 serotype Ml 2 
(A735), S. pyogenes strain MGAS3 15 serotype M3, S. pyogenes strain SSM serotype M3, GAS AI-3 
serotype M5 (Manfredo), and S. pyogenes strain MGAS8232 serotype M3. 

FIGURE 57: Results of FASTA homology search for amino acid sequences that align with 
5 the collagen adhesion protein of GAS AI-1 serotype M6 (MG AS 10394). 

FIGURE 58: Results of FASTA homology search for amino acid sequences that align with 
the fimbrial structural subunit of GAS AI-1 serotype M6 (MGAS 10394). 

FIGURE 59: Results of FASTA homology search for amino acid sequences that align with 
the hypothetical protein of GAS AI-2 serotype Ml (SF370). 
10 FIGURE 60: Specifies pilin and E box motifs present in GAS type 3 and 4 adhesin islands. 

FIGURE 61: Illustrates that surface expression of GBS 80 protein on GBS strains COH and 
JM9130013 correlates with formation of pili structures. Surface expression of GBS 80 was 
determined by FACS analysis using an antibody that cross-hybridizes with GBS 80. Formation of pili 
structures was determined by immunogold electron microscopy using gold-labelled anti-GBS 80 
15 antibody. 

FIGURE 62: Illustrates that surface exposure is capsule-dependent for GBS 322 but not for 
GBS 80. 

FIGURE 63: Illustrates the amino acid sequence identity of GBS 59 proteins in GBS strains. 
FIGURE 64: Western blotting of whole GBS cell extracts with anti-GBS 59 antibodies. 
20 FIGURE 65: Western blotting of purified GBS 59 and whole GBS cell extracts with anti- 

GBS 59 antibodies. 

FIGURE 66: FACS analysis of GBS strains CJB1 1 1, 7357B, 515 using GBS 59 antiserum. 
FIGURE 67: Illustrates that anti-GBS 59 antibodies are opsonic for CJB1 1 1 GBS strain 
serotype V. 

25 FIGURE 68: Western blotting of GBS strain JM9130013 total extracts. 

FIGURE 69: Western blotting of GBS stain 515 total extracts shows that GBS 67 and GBS 
1 50 are parts of a pilus. 

FIGURE 70: Western blotting of GBS strain 515 knocked out for GBS 67 expression 
FIGURE 71: FACS analysis of GBS strain 515 and GBS strain 515 knocked out for GBS 67 
30 expression using GBS 67 and GBS 59 antiserum. 

FIGURE 72: Illustrates complementation of GBS 515 knocked out for GBS 67 expression 
with a construct overexpressing GBS 80. 

FIGURE 73: FACS analysis of GAS serotype M6 for spyM6_0159 surface expression. 
FIGURE 74: FACS analysis of GAS serotype M6 for spyM6_0160 surface expression. 
35 FIGURE 75: FACS analysis of GAS serotype Ml for GAS 15 surface expression. 

FIGURE 76: FACS analysis of GAS serotype Ml for GAS 16 surface expression using a 
first anti-GAS 16 antiserum. 
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p C TKStMl&O EA0Si:i&HysK:£IPliAS serotype Ml for GAS 18 surface expression using a 
first anti-GAS 1 8 antiserum. 

FIGURE 78: FACS analysis of GAS serotype Ml for GAS 18 surface expression using a 
second anti-GAS 18 antiserum. 
5 FIGURE 79: FACS analysis of GAS serotype Ml for GAS 16 surface expression using a 

second anti-GAS 1 6 antisera. 

FIGURE 80: FACS analysis of GAS serotype M3 for spyM3_0098 surface expression. 

FIGURE 81: FACS analysis of GAS serotype M3 for spyM3_0100 surface expression. 

FIGURE 82: FACS analysis of GAS serotype M3 for spyM3_0102 surface expression. 
10 FIGURE 83: FACS analysis of GAS serotype M3 for spyM3_0104 surface expression. 

FIGURE 84: FACS analysis of GAS serotype M3 for spyM3_0106 surface expression. 

.FIGURE 85: FACS analysis of GAS serotype M12 for 19224134 surface expression. 

FIGURE 86: FACS analysis of GAS serotype M12 for 19224135 surface expression. 

FIGURE 87: FACS analysis of GAS serotype M12 for 19224137 surface expression. 
15 FIGURE 88: FACS analysis of GAS serotype M12 for 19224141 surface expression. 

FIGURE 89: Western blot analysis of GAS 15 expression on GAS Ml bacteria. 

FIGURE 90: Western blot analysis of GAS 15 expression using GAS 15 immune sera. 

FIGURE 91: Western blot analysis of GAS 15 expression using GAS 15 pre-immune sera. 

FIGURE 92: Western blot analysis of GAS 16 expression on GAS Ml bacteria. 
20 FIGURE 93: Western blot analysis of GAS 16 expression using GAS 16 immune sera. 

FIGURE 94: Western blot analysis of GAS 16 expression using GAS 16 pre-immune sera. 

FIGURE 95: Western blot analysis of GAS 18 on GAS Ml bacteria. 

FIGURE 96: Western blot analysis of GAS 18 using GAS 18 immune sera. 

FIGURE 97: Western blot analysis of GAS 18 using GAS 18 pre-immune sera. 
25 FIGURE 98: Western blot analysis of M6_Spy0159 expression on GAS bacteria. 

FIGURE 99: Western blot analysis of 19224135 expression on M12 GAS bacteria. 

FIGURE 100: Western blot analysis of 19224137 expression on M12 GAS bacteria. 

FIGURE 101 : Full length nucleotide sequence of an S. pneumoniae strain 670 AI. 

FIGURE 102: Western blot analysis of GAS 15, GAS 16, and GAS 18 in GAS Ml strain 

30 2580. 

FIGURE 103: Western blot analysis of GAS 15, GAS 16, and GAS 18 in GAS Ml strain 

2913. 

FIGURE 104: Western blot analysis of GAS 15, GAS 16, and GAS 18 in GAS Ml strain 

3280. 

35 FIGURE 105: Western blot analysis of GAS 15, GAS 16, and GAS 18 in GAS Ml strain 



3348. 



2719. 



FIGURE 106: Western blot analysis of GAS 15, GAS 16, and GAS 18 in GAS Ml strain 
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|p C TI@lffi©^^ Salisis of GAS 15, GAS 16, and GAS 18 in GAS Ml strain 

SF370. 

FIGURE 108: Western blot analysis of 19224135 and 19224137 in GAS M12 strain 2728. 
FIGURE 109: Western blot analysis of 19224139 in GAS M12 strain 2728 using antisera 
5 raised against SpyM3_0 102. 

FIGURE 110: Western blot analysis of M6_Spy0159 and M6_Spy0160 in GAS M6 strain 

2724. 

FIGURE 111: Western blot analysis of M6_Spy0 1 59 and M6__Spy0 1 60 in GAS M6 strain 

SF370. 

10 FIGURE 1 12: Western blot analysis of M6_Spyl60 in GAS M6 strain 2724. 

FIGURES 113-115: Electron micrographs of surface exposed GAS 15 on GAS Ml strain 

SF370. 

FIGURES 116-121: Electron micrographs of surface exposed GAS 16 on GAS Ml strain 

SF370. 

15 FIGURES 122-125: Electron micrographs of surface exposed GAS 18 on GAS Ml strain 

SF370 detected using anti-GAS 18 antisera. 

FIGURE 126: IEM image of a hyperoligomer on GAS Ml strain SF370 detected using anti- 
GAS 18 antisera. 

FIGURES 127-132: IEM images of oligomeric and hyperoligomeric structures containing 
20 M6_Spy0160 extending from the surface of GAS serotype M6 3650. 

FIGURE 133 A and B: Western blot analysis of L. lactis transformed to express GBS 80 with 
anti-GBS 80 antiserum. 

FIGURES 134: Western blot analyses of L. lactis transformed to express GBS AI-1 with 
anti-GBS 80 antiserum. 

25 FIGURE 135: Ponceau staining of same acrylamide gel as used in Figure 134. 

FIGURE 136A: Western blot analysis of sonicated pellets and supernatants of cultured L. 
lactis transformed to express GBS AI-1 polypeptides using anti-GBS 80 antiserum. 

FIGURE 136B: Polyacrylamide gel electrophoresis of sonicated pellets and supernatants of 
cultured L. lactis transformed to express GBS AI polypeptides. 
30 FIGURE 137: Depiction of an example S. pneumoniae AI locus. 

FIGURE 138: Schematic of primer hybridization sites within the S. pneumoniae AI locus of 
FIGURE 137. 

FIGURE 139 A: The set of amplicons produced from the S. pneumoniae strain TIGR4 AI 

locus. 

35 FIGURE 139B: Base pair lengths of amplicons produced from FIGURE 139A primers in S. 

pneumoniae strain TIGR4. 

FIGURE 140: CGH analysis of S. pneumoniae strains for the AI locus. 
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p C W^KMIt !3p^^Sq5M31<» alignment of polypeptides encoded by AI orf 2 in & 
pneumoniae Al-positive strains. 

FIGURE 142: Amino acid sequence alignment of polypeptides encoded by AI orf 3 in S, 
pneumoniae Al-positive strains. 
5 FIGURE 143: Amino acid sequence alignment of polypeptides encoded by AI orf 4 in 6*. 

pneumoniae Al-positive strains. 

FIGURE 144: Amino acid sequence alignment of polypeptides encoded by AI orf 5 in S. 
pneumoniae Al-positive strains. 

FIGURE 145: Amino acid sequence alignment of polypeptides encoded by AI orf 6 in S. 
10 pneumoniae Al-positive strains. 

FIGURE 146: Amino acid sequence alignment of polypeptides encoded by AI orf 7 in S. 
pneumoniae Al-positive strains. 

FIGURE 147: Amino acid sequence alignment of polypeptides encoded by AI orf 8 in S. 
pneumoniae Al-positive strains. 
15 FIGURE 148: Diagram comparing amino acid sequences of RrgA in S. pneumoniae strains. 

FIGURE 149: Amino acid sequence comparison of RrgB S. pneumoniae strains. 
FIGURE 150A: Sp0462 amino acid sequence. 

FIGURE 150B: Primers used to produce a clone encoding the Sp0462 polypeptide. 

FIGURE 151 A: Schematic depiction of recombinant Sp0462 polypeptide. 
20 FIGURE 151B: Schematic depiction of full-length Sp0462 polypeptide. 

FIGURE 152A: Western blot probed with serum obtained from S. pneumoniae~m.tcct.Qd 
patients for Sp0462. 

FIGURE 152B: Western blot probed with GBS 80 serum for Sp0462. 

FIGURE 153 A: Sp0463 amino acid sequence. 
25 FIGURE 153B: Primers used to produce a clone encoding the Sp0463 polypeptide. 

FIGURE 154 A: Schematic depiction of recombinant Sp0463 polypeptide. 

FIGURE 154B: Schematic depiction of full-length Sp0463 polypeptide. 

FIGURE 155: Western blot detection of recombinant Sp0463 polypeptide. 

FIGURE 156: Western blot detection of high molecular weight Sp0463 polymers. 
30 FIGURE 157A: Sp0464 amino acid sequence. 

FIGURE 157B: Primers used to produce a clone encoding the Sp0464 polypeptide. 

FIGURE 15 8 A: Schematic depiction of recombinant Sp0464 polypeptide. 

FIGURE 158B: Schematic depiction of full-length Sp0464 polypeptide. 

FIGURE 159: Western blot detection of recombinant Sp0464 polypeptide. 
35 FIGURE 160: Amplification products prepared for production of Sp0462, Sp0463, and 

Sp0464 clones. 

FIGURE 161 : Opsonic killing by anti-sera raised against L. lactis expressing GBS AI 
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P C IftSUdfiQS^ GAS adhesin islands GAS AM, GAS AI-2, GAS AI-3 

and GAS AI-4. 

FIGURES 163 A-D: Immunoblots of cell-wall fractions of GAS strains with antisera specific 
for LPXTG proteins of M6_ ISS3650 (A), M1_SF370 (B), M5JSS4883 (C) and M12_20010296 (D). . 
5 FIGURES 163 E-H: Immunoblots of cell-wall fractions of deletion mutants M1_SF370A128 

(E) M1JSF370A130 (F) Ml_SF370ASrtCl (G) and the Ml_128 deletion strain complemented with 
plasmid pAM::128 which contains the Ml_128 gene (H) with antisera specific for the pilin 
components of Ml SF370. 

FIGURES 163 I-N: Immunogold labelling and transmission electron microscopy of: T6 (I) 
10 and Cpa (J) in M6JLSS3650; Ml__128 in M1_SF370 (K) and deletion strain M1JSF370A128 (N); 
M5_orf80 in M5_ISS4883 (L); M12JEftLSL.A in M12__200 10296 (M). The strains used are indicated 
below the panels. Bars=200nm. 

FIGURE 164: Schematic representation of the FCT region from 7 GAS strains 

FIGURES 165 A-H: Flow cytometry of GAS bacteria treated or not with trypsin and stained 
15 with sera specific for the major pilus component. Preimmune staining; black lines, untreated bacteria; 
green lines and trypsin treated bacteria; blue lines. M6_ISS3650 stained with sera which recognize the 
M6 protein (A) or anti-M6_T6 (B), M1_SF370 stained with anti-Mi (C) or anti-Ml_128 (D), 
M5JSS4883 stained with anti-PrtF (E) or anti-M5_orf80 (F) and Ml 2_200 10296 with anti-M12 (G) 
or anti-EftLSL.A (H) 

20 FIGURES 166 A-C: Immunoblots of recombinant pilin components with polyvalent 

Lancefield T-typing sera. The recombinant proteins are shown above the blot and the sera pool used is 
shown below the blot. 

FIGURES 166 D-G: Immunoblots of pilin proteins with monovalent T-typing sera. The 
recombinant proteins are shown below the blot and the sera used above the blot. 
25 Figure 166 H and I Flow cytometry analysis of strain M1_SF370 (H) and the deletion strain 

M1_SF370A128 (I) with T-typing antisera pool T. 

FIGURE 167: Chart describing the number and type of sortase sequences identified within 
GAS AIs. 

FIGURE 168 A: Immunogold-electronmicroscopy of L. lactis lacking an expression 
30 construct for GBS AM using anti-GBS 80 antibodies. 

FIGURE 168 B and C: Immunogold-electronmicroscopy detects GBS 80 in oligomeric 
(pilus) structures on surface of L. lactis transformed to express GBS AI-1 

FIGURE 169: FACS analysis detects expression of GBS 80 and GBS 104 on the surface of 
L, lactis transformed to express GBS AI-1. 
35 FIGURE 170: Phase contrast microscopy and immuno-electronmicroscopy shows that 

expression of GBS AI-1 inZ,. lactis induces L. lactis aggregation. 

FIGURE 171 : Purification of GBS pili from L. lactis transformed to express GBS AI-1 . 
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PC IMMElffi^ of GAS M6 (AI-1), Ml (AI-2), and M12 (AI-4) adhesin 

islands and portions of the adhesin islands inserted in the pAM401 construct for expression in L. 
lactis. 

FIGURE 173 A-C: Western blot analysis showing assembly of GAS pili in L. lactis 
5 expressing GAS AI-2 (Ml) (A), GAS AI-4 (M12) (B), and GAS AI-1 (M6) (C). 

FIGURE 174: FACS analysis of GAS serotype M6 for M6_Spy0157 surface expression. 

FIGURE 175: FACS analysis of GAS serotype M12 for 19224139 surface expression. 

FIGURE 176 A-E: Immunogold electron microscopy using antibodies against M6_Spy0160 
detects pili on the surface of M6 strain 2724. 
10 FIGURE 176 F: Immunogold electron microscopy using antibodies against M6__Spy0159 

detects M6_Spy0159 surface expression on M6 strain 2724. 

FIGURE 177 A-C: Western blot analysis of Ml strain SF370 GAS bacteria individually 
deleted for Ml__130, SrtCl, or Ml__128 using anti-Ml_130 serum (A), anti-Ml_128 serum (B), and 
anti-Ml_126 serum (C). 

15 FIGURE 178 A-C: Immunogold electron microscopy using antibodies against Ml_128 to 

detect surface expression on wildtype strain SF370 bacteria (A), Ml_128 deleted SF370 bacteria (B), 
and SrtCl deleted SF370 bacteria (C). 

FIGURE 179 A-C: FACS analysis to detect expression of Ml_126 (A), Ml_128 (B), and 
M1130 (C) on the surface of wildtype SF370 GAS bacteria. 
20 FIGURE 179 D-F: FACS analysis to detect expression of Ml_126 (D), Ml_128 (E), and 

M1130 (F) on the surface of Ml__128 deleted SF370 GAS bacteria. 

FIGURE 179 G-I: FACS analysis to detect expression of Ml_126 (G), Ml_128 (H), and 
Ml_130 (I) on the surface of SrtCl deleted SF370 GAS bacteria. 

FIGURE 180 A and B: FACS analysis of wildtype (A) and LepA deletion mutant (B) strains 
25 of SF370 bacteria for Ml surface expression. 

FIGURE 181: Western blot analysis detects high molecular weight polymers in S. 
pneumoniae TIGR4 using anti-RrgB antisera. 

FIGURE 1 82: Detection of high molecular weight polymers in S. pnuemoniae rlrA positive 

strains. 

30 FIGURE 183: Detection of high molecular weight polymers in S. pneumoniae TIGR4 by 

silver staining and Western blot analysis using anti-RrgB antisera. 

FIGURE 184: Deletion of S. pneumoniae TIGR4 adhesin island sequences interferes with the 
ability of S. pneumoniae to adhere to A549 alveolar cells. 

FIGURE 185: Negative staining of S. pneumoniae strain TIGR4 showing abundant pili on 
35 the bacterial surface. 

FIGURE 186: Negative staining of strain TIGR4 deleted for rrgA-srtD adhesin island 
sequences showing no pili on the bacterial surface 
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IF 1 ' C llGtJ&ffi the TIGR4 mgrA mutant showing abundant pili on the 

bacterial surface. 

FIGURE 188: Negative staining of the negative control TIGR4 mgrA mutant deleted for 
adhesin island sequences rrgA-srtD showing no pili on the bacterial surface, 
5 FIGURE 189: Immuno-gold labelling of S. pneumoniae strain TIGR4 grown on blood agar 

solid medium using a-RrgB (5nm) and ct-RrgC (lOnrn). Bar represents 200nm. 

FIGURE 190 A and B: Detection of expression and purification of S. pneumoniae RrgA 
protein by SDS-PAGE (A) and Western blot analysis (B). 

FIGURE 191: Detection of RrgB by antibodies produced in mice. 
10 FIGURE 192: Detection of RrgC by antibodies produced in mice. 

FIGURE 193: Purification of S. pneumoniae TIGR 4 piii by a cultivation and digestion 
method and detection of the purified TIGR4 pili. 

FIGURE 194: Purification of S. pneumoniae TIGR 4 pili by a sucrose gradient centrifugation 
method and detection of the purified TIGR4 pili. 
15 FIGURE 195: Purification of S. pneumoniae TIGR 4 pili by a gel filtration method and 

detection of the purified TIGR4 pili. 

FIGURE 196: Alignment of full length S. pneumoniae adhesin island sequences from ten S. 
pneumoniae strains. 

FIGURE 197 A: Schematic of GBS AI-1 coding sequences. 
20 FIGURE 197 B: Nucleotide sequence of intergenic region between AraC and GBS 80 (SEQ 

ID NO: 273. 

FIGURE 197 C: FACS analysis results for GBS 80 expression in GBS strains having 
different length polyA tracts in the intergenic region between AraC and GBS 80. 

FIGURE 198: Table comparing the percent identity of surface proteins encoded by a 
25 serotype M6 (harbouring a GAS AI-1) adhesin island relative to other GAS serotypes harbouring an 
adhesin island. 

FIGURE 199: Table comparing the percent identity of surface proteins encoded by a 
serotype Ml (harbouring a GAS AI-2) adhesin island relative to other GAS serotypes harbouring an 
adhesin island. 

30 FIGURE 200: Table comparing the percent identity of surface proteins encoded by serotypes 

M3, M18, M5, and M49 (harbouring GAS AI-3) adhesin islands relative to other GAS serotypes 
harbouring an adhesin island. 

FIGURE 201: Table comparing the percent identity of surface proteins encoded by a 
serotype M12 (harbouring a GAS AI-1) adhesin island- relative to other GAS serotypes harbouring an 
35 adhesin island. 

FIGURE 202: GBS 80 recombinant protein does not bind to epithelial cells. 
FIGURE 203: Deletion of GBS 80 protein does not affect the ability of GBS to adhere and 
invade ME 180 cervical epithelial cells. 
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P C TFlGttffiMK matrix proteins. 

FIGURE 205: Deletion of GBS 104 protein, but not GBS 80, reduces the capacity of GBS to 
invade J774 macrophage-like cells 

FIGURE 206: GBS 104 knockout mutant strains of bacteria translocate through an epithelial 
monolayer less efficiently that the isogenic wild type strain. 

FIGURE 207: GBS 80 knockout mutant strains of bacteria partially lose the ability to 
translocate through an epithelial monolayer. 

FIGURE 208: GBS adherence to HUVEC endothelial cells. 

FIGURE 209: Strain growth rate of wildtype, GBS 80-deleted, or GBS 104 deleted COH1 

GBS. 

FIGURE 210: Binding of recombinant GBS 104 protein to epithelial cells by FACS analysis. 

FIGURE 211: Deletion of GBS 104 proteinin the GBS strain COH1 reduces the ability of 
GBS to adhere to ME180 cervical epithelial cells. 

FIGURE 212: COH1 strain GBS overexpressing GBS 80 protein has an impaired capacity to 
translocate through an epithelial monolayer. 

FIGURE 213: Scanning electron microscopy shows that overexpression of GBS 80 protein 
on COH1 strain GBS enhances the capacity of the COH1 bacteria to form microcolonies on epithelial 
cells. 

FIGURE 214: Confocal imaging shows that overexpression of GBS 80 proteins on COH1 
strain GBS enhances the capacity of the COH1 bacteria to form microcolonies on epithelial cells. 

FIGURE 215: Detection of GBS 59 on the surface of GBS strain 515 by immuno-electron 
microscopy. 

FIGURE 216: Detection of GBS 67 on the surface of GBS strain 515 by immuno-electron 
microscopy. 

FIGURE 217: GBS 67 binds to fibronectin. 

FIGURE 218: Western blot analysis shows that deletion of both GBS AI-2 sortase genes 
abolishes assembly of the pilus. 

FIGURE 219: FACS analysis shows that deletion of both GBS AI-2 sortase genes abolishes 
assembly of the pilus. 

FIGURE 220 A-C: Western blot analysis shows that GBS 59, GBS 67, and GBS 150 form 
high molecular weight complexes. 

FIGURE 221 A-C: Western blot analysis shows that GBS 59 is required for polymer 
formation of GBS 67 and GBS 150. 

FIGURE 222: FACS analysis shows that GBS 59 is required for surface exposure of GBS 67. 

FIGURE 223: Summary Western blots for detection of GBS 59, GBS 67, or GBS 150 in 
GBS 515 and GBS 515 mutant strain. 

FIGURE 224: Description of GBS 59 allelic variants. 
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f >C TP I<ftM5 ^zBScrfeiSsy S^Sfec only against a strain of GBS expressing a homologous 
GBS 59. 

FIGURE 226 A and B: Results of FACS analysis for surface expression of GBS 59 using 
antibodies specific for different GBS 59isoforms. 



104, GBS 322, GBS 67, and GBS 59 on 41 various strains of GBS bacteria. 

FIGURE 228: Results of FACS analysis for surface expression of GBS 80, GBS 104, GBS 
322, GBS 67, and GBS 59 on 41 strains of GBS bacteria obtained from the CDC. 

FIGURE 229: Expected immunogenicity coverage of different combinations of GBS 80, 
10 GBS 104, GBS 322, GBS 67, and GBS 59 across strains of GBS bacteria. 

FIGURE 230: GBS 59 opsonophagocytic activity is comparable to that of a mixture of GBS 
80, GBS 104, GBS 322 and GBS 67. 

FIGURE 23 1 A-C: Schematic presentation of example hybrid GBS AIs. 
FIGURE 232: Schematic presentation of an example hybrid GBS AI. 
15 FIGURE 233 A and B: Western blot and FACS analysis detect expression of GBS 80 and 

GBS 67 on the surface of L. lactis transformed with a hybrid GBS AI. 
FIGURE 234 A-E Hybrid GBS AI cloning strategy. 

FIGURE 235: High magnification of S. pneumoniae strain TIGR4 pili double labeled with a- 
RrgB (5nm) and a-RrgC (lOnrri). Bar represents lOOnm. 
20 FIGURE 236: Immuno-gold labeling of the S. pneumoniae TIGR4 rrgA-srtD deletion mutant 

with no visible pili on the surface detectable by a- RrgB- and a-RrgC. Bar represents 200nm. 

FIGURE 237: Variability in GBS 67 amino acid sequences between strains 2603 and H36B. 
FIGURE 238: Strain variability in GBS 67 amino acid sequences of allele I (2603). 
FIGURE 239: Stran variability in GBS 67 amino acid sequence of allele II (H36B). 



5 



FIGURE 227 A and B: Results of FACS analysis for surface expression of GBS 80, GBS 
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BRIEF DESCRIPTION OF THE TABLES 



35 



30 



TABLE 1 : Active Maternal Immunization Assay for fragments of GBS 80 
TABLE 2: Passive Maternal Immunization Assay for fragments of GBS 80 
TABLE 3: Lethal dose 50% of AM mutants from GBS strain isolate 2603. 
TABLE 4: GAS AI-1 sequences from M6 isolate (MGAS 10394). 
TABLE 5: GAS AI-2 sequences from Ml isolate (SF370). 
TABLE 6: GAS AI-3 sequences from M3 isolate (MGAS315). 
TABLE 7: GAS AI-3 sequences from M3 isolate (SSI-1). 
TABLE 8: GAS AI-3 sequences from M18 isolate (MGAS8232). 
TABLE 9: S. pneumoniae AI sequences from TIGR4 sequence. 
TABLE 10: GAS AI-3 sequences from M5 isolate (Manfredo). 
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p C from M12 isolate (A735). 

TABLE 12: Conservation of GBS 80 and GBS 104 amino acid sequences. 

TABLE 13: Conservation of GBS 322 and GBS 276 amino acid sequences. 

TABLE 14: Active maternal immunization assay for. a combination of fragments from GBS 
322, GBS 80, GBS 104, and GBS 67. 

TABLE 15: Antigen surface exposure of GBS 80, GBS 322, GBS 104, and GBS 67. 

TABLE 16: Active maternal immunization assay for each of GBS 80 and GBS 322 antigens. 

TABLE 17: Active maternal immunization assay for GBS 59. 

TABLE 18: Summary of FACS values for surface expression of spyM6__0159. 

TABLE 19: Summary of FACS values for surface expression of spyM6J3160. 

TABLE 20: Summary of FACS values for surface expression of GAS 15. 

TABLE 21: Summary of FACS values for surface expression of GAS 16. 

TABLE 22: Summary of FACS values for surface expression of GAS 16 using a second 
antisera. 

TABLE 23: Smrimary of FACS values for surface expression of GAS 18. 

TABLE 24: Summary of FACS values for surface expression of GAS 18 using a second 

antisera. 

TABLE 25: Summary of FACS values for surface expression of SpyM3__0098. 
TABLE 26: Summary of FACS values for surface expression of SpyM3_0100. 
TABLE 27: Summary of FACS values for surface expression of SpyM3__0102 in M3 
serotypes. 

TABLE 28: Summary of FACS values for surface expression of SpyM3_0102 in M6 
serotypes. 

TABLE 29: Summary of FACS values for surface expression of SpyM3_0104 in M3 
serotypes. 

TABLE 30: Summary of FACS values for surface expression of SpyM3_0104 in an Ml 2 
serotype. 

TABLE 3 1 : Summary of FACS values for surface expression of SPs_0106 in M3 serotypes. 
TABLE 32: Summary of FACS values for surface expression of SPs_0106 in an M 12 
serotype. 

TABLE 33: Summary of FACS values for surface expression of 19224134 in an M12 
serotype. 

TABLE 34: Summary of FACS values for surface expression of 19224134 in M6 serotypes. 
TABLE 35: Summary of FACS values for surface expression of 19224135 in an M12 
serotype. 

TABLE 36: Summary of FACS values for surface expression of 19224137 in an M12 
serotype. 



-33- 



WO 2006/078318 PCT/US2005/027239 

P it::;: for surface expression of 19224141 in an M 12 

serotype. 

TABLE 38: S. pneumoniae strain 670 AI sequences. 

TABLE 39: Pecent identity comparison of S. pneumoniae strains AI sequences. 
5 TABLE 40: FACS analysis of L. lactis and GBS bacteria strains expressing GBS ALL 

TABLE 41 : Sequences of primers used to amplify AI locus. 

TABLE 42: Conservation of amino acid sequences encoded by the S. pneumoniae AI locus. 
TABLE 43: Protection of Mice Immunized with L. lactis expressing GBS AI-1. 
TABLE 44: GAS AI-3 sequences from M49 isolate (591). 
10 TABLE 45: Comparison of Sequences Between the Four GAS AIs. 

TABLE 46: Antibody Responses against GBS 80 in Serum of Mice Immunized with L. lactis 
Expressing GBS AI-1 

TABLE 47: Anti-GBS 80 IgA Antibodies Detected in Mouse Tissues Following 
Immunization withZ. lactis Expressing GBS AI-1 
15 TABLE 48: GBS 67 Protects Mice in an Immunization Assay 

TABLE 49: Exposure Levels of GBS 80, GBS 104, GBS 67, GBS 322, and GBS 59 on GBS 

Strains 

TABLE 50: High Levels of Surface Protein Expression on GBS Serotypes 

TABLE 5 1 : Further Protection of Mice Immunized with L. lactis expressing GB S AI- 1 

20 

DETAILED DESCRIPTION OF THE INVENTION 

The practice of the present invention will employ, unless otherwise indicated, conventional 
methods of chemistry, biochemistry, molecular biology, immunology and pharmacology, within the 
skill of the art. Such techniques are explained fully in the literature. See, e.g., Remington's 
Pharmaceutical Sciences, Mack Publishing Company, Easton, Pa., 19th Edition (1995); Methods In 
Enzymology (S. Colowick and N. Kaplan, eds., Academic Press, Inc.); and Handbook of Experimental 
Immunology, Vols. I-IV (D.M. Weir and C.C. Blackwell, eds., 1986, Blackwell Scientific 
Publications); Sambrook, et al., Molecular Cloning: A Laboratory Manual (2nd Edition, 1989); 
Handbook of Surface and Colloidal Chemistiy (Birdi, K.S. ed., CRC Press, 1997); Short Protocols in 
Molecular Biology, 4th ed. (Ausubel et al. eds., 1999, John Wiley & Sons); Molecular Biology 
Techniques: An Intensive Laboratory Course, (Ream et al., eds., 1998, Academic Press); PCR 
(Introduction to Bio techniques Series), 2nd ed. (Newton & Graham eds., 1997, Springer Verlag); . 
Peters and Dalrymple, Fields Virology (2d ed), Fields et al. (eds.), B.N. Raven Press, New York, NY. 

All publications, patents and patent applications cited herein, are hereby incorporated by 
reference in their entireties. 

As used herein, an "Adhesin Island" or "AI" refers to a series of open reading frames within a 

bacterial genome, such as the genome for Group A or Group B Streptococcus or other gram positive 

bacteria, that encodes for a collection of surface proteins and sortases. An Adhesin Island may 
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eSbfe I!r4rM<^ Jompiiil^g at least one surface protein. The Adhesin Island may 

encode at least one surface protein. Alternatively, an Adhesin Island may encode for at least two 
surface proteins and at least one sortase. Preferably, an Adhesin Island encodes for at least three 
surface proteins and at least two sortases. One or more of the surface proteins may include an 
LPXTG motif (such as LPXTG (SEQ ID NO: 122)) or other sortase substrate motif. One or more AI 
surface proteins may participate in the formation of a pilus structure on the surface of the gram 
positive bacteria. 

Adhesin Islands of the invention preferably include a divergently transcribed transcriptional 
regulator (i.e., the transcriptional regulator is located near or adjacent to the AI protein open reading 
frames, but it transcribed in the opposite direction). The transcriptional regulator may regulate the 
expression of the AI operon. 
GBS Adhesin Island 1 

As discussed above, Applicants have identified a new adhesin island, "Adhesin Island 1", 
"AM", or "GBS AM", within the genomes of several Group B Streptococcus serotypes and isolates. 
AM comprises a series of approximately five open reading frames encoding for a collection of amino 
acid sequences comprising surface proteins and sortases ("AM proteins"). Specifically, AM 
includes open reading frames encoding for two or more (Le., 2, 3, 4 or 5) of GBS 80, GBS 104, GBS 
52, SAG0647 and SAG0648. One or more of the AM open reading frame polynucleotide sequences 
may be replaced by a polynucleotide sequence coding for a fragment of the replaced ORP. 
Alternatively, one or more of the AM open reading frames may be replaced by a sequence having 
sequence homology to the replaced ORF. 

A schematic of AM is presented in Figure 1. AM typically resides on an approximately 16.1 
kb transposon-like element frequently inserted into the open reading frame for trmA. One or more of 
the AM surface protein sequences typically include an LPXTG motif (such as LPXTG (SEQ ID NO: 
122)) motif or other sortase substrate motif. The AI surface proteins of the invention may affect the 
ability of the GBS bacteria to adhere to and invade epithelial cells. AI surface proteins may also 
affect the ability of GBS to translocate through an epithelial cell layer. Preferably, one or more AI 
surface proteins are capable of binding to or otherwise associating with an epithelial cell surface. AI 
surface proteins may also be able to bind to or associate with fibrinogen, fibronectin, or collagen. 

The AM sortase proteins are predicted to be involved in the secretion and anchoring of the 
LPXTG containing surface proteins. AI-1 may encode for at least one surface protein. Alternatively, 
AM may encode for at least two surface exposed proteins and at least one sortase. Preferably, AI-1 
encodes for at least three surface exposed proteins and at least two sortases. The AM protein 
preferably includes GBS 80 or a fragment thereof or a sequence having sequence identity thereto. 

As used herein, an LPXTG motif represents an amino acid sequence comprising at least five 
amino acid residues. Preferably, the motif includes a leucine (L) in the first amino acid position, a 
proline (P) in the second amino acid position, a threonine (T) in the fourth amino acid position and a 

glycine (G) in the fifth amino acid position. The third position, represented by X, may be occupied by 
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a^S^^^WSM&vS^^l^SSt 3is occupied by lysine (K), Glutamate (E), Asparagine (N), 
Glutamine (Q) or Alanine (A). Preferably, the X position is occupied by lysine (K). In some 
embodiments, one of the assigned LPXTG amino acid positions is replaced with another amino acid. 
Preferably, such replacements comprise conservative amino acid replacements, meaning that the 
replaced amino acid residue has similar physiological properties to the removed amino acid residue. 
Genetically encoded amino acids may be divided into four families based on physiological properties: 
(1) acidic (asparatate and glutamate), (2) basic (lysine, arginine, histitidine), (3) non-polar (alanine, 
valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophane) and (4) uncharged polar 
(glycine, asparagines, glutamine, cysteine, serine, threonine, and tyrosine). Phenylalanine, tryptophan 
and tyrosine are sometimes classified jointly as aromatic amino acids. For example, it is reasonably 
predictable that an isolated replacement of a leucine with an isoleucine or valine, an asparate with a 
glutamate, a threonine with a serine, or a similar conservative replacement of an amino acid with a 
structurally related amino acid will not have a major effect on the biological activity. 

The first amino acid position of the LPXTG motif may be replaced with another amino acid 
residue. Preferably, the first amino acid residue (leucine) is replaced with an alanine (A), valine (V), 
isoleucine (I), proline (P), phenylalanine (F), methionine (M), glutamic acid (E), glutamine (Q), or 
tryptophan (Y) residue. In one preferred embodiment, the first amino acid residue is replaced with an 
isoleucine (I). 

The second amino acid residue of the LPXTG motif may be replaced with another amino acid 
residue. Preferably, the second amino acid residue praline (P) is replaced with a valine (V) residue. 

The fourth amino acid residue of the LPXTG motif may be replaced with another amino acid 
residue. Preferably, the fourth amino acid residue (threonine) is replaced with a serine (S) or an 
alanine (A). 

In general, an LPXTG motif may be represented by the amino acid sequence XXXXG, in 
which X at amino acid position 1 is an L, a V, an E, an I, an F, or a Q; X at amino acid position 2 is a 
P if X at amino acid position 1 is an L, an I, or an F; X at amino acid position 2 is a V if X at amino 
acid position 1 is a E or a Q; X at amino acid position 2 is a V or a P if X at amino acid position 1 is a 
V; X at amino acid position 3 is any amino acid residue; X at amino acid position 4 is a T if X at 
amino acid position 1 is a V, E, I, F, or Q; and X at amino acid position 4 is a T, S, or A if X at amino 
acid position 1 is an L. 

Generally, the LPXTG motif of a GBS AI protein may be represented by the amino acid 
sequence XPXTG, in which X at amino acid position 1 is L, I, or F, and X at amino acid position 3 is 
any amino acid residue. Specific examples of LPXTG motifs in GBS AI proteins may include 
LPXTG (SEQ ID NO: 122) or IPXTG (SEQ ID NO: 133). 

As discussed further below, the threonine in the fourth amino acid position of the LPXTG 
motif may be involved in the formation of a bond between the LPXTG containing protein and a cell 
wall precursor. Accordingly, in preferred LPXTG motifs, the threonine in the fourth amino acid 
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riBitM acid or, if the threonine is replaced, the replacement 

amino acid is preferably a conservative amino acid replacement, such as serine. 

Instead of an LPXTG motif, the AI surface proteins of the invention may contain alternative 
sortase substrate motifs such as NPQTN (SEQ ID NO: 142), NPKTN (SEQ ID NO: 168), NPQTG 
(SEQ ID NO: 169), NPKTG (SEQ ID NO: 170), XPXTGG (SEQ ID NO: 143), LPXTAX (SEQ ID 
NO: 144), or LAXTGX (SEQ ID NO: 145). (Similar conservative amino acid substitutions can also 
be made to these membrane motifs). 

The AI surface proteins may be covalently attached to the bacterial cell wall by membrane- 
associated transpeptidases, such as an AI sortase. The sortase may function to cleave the surface 
protein, preferably between the threonine and glycine residues of an LPXTG motif. The sortase may 
then assist in the formation of an amide link between the threonine carboxyl group and a cell wall 
precursor such as lipid II. The precursor can then be incorporated into the peptidoglycan via the 
transglycoslylation and transpeptidation reactions of bacterial wall synthesis. See Comfort et al„ 
Infection & Immunity (2004) 72(5): 2710 - 2722. 

The AI surface proteins may be polymerized into pili by sortase-catalysed transpeptidation. 
(See Figure 44.) Cleavage of AI surface proteins by sortase between the threonine and glycine 
residues of an LPXTG motif yields a thioester-linked acyl intermediate of sortase. Many AI surface 
proteins include a pilin motif amino acid sequence which interacts with the sortase and LPXTG amino 
acid sequence. The first lysine residue in a pilin motif can serve as an amino group acceptor of the 
cleaved LPXTG motif and thereby provide a covalent linkage between AI subunits to form pili. For 
example, the pilin motif can make a nucleophilic attack on the acyl enzyme providing a covalent 
linkage between AI subunits to form pili and regenerate the sortase enzyme. Examples of pilin motifs 
may include ((YPKN(X 10 )K; SEQ ID NO: 146), (YPKN(X9)K; SEQ ID NO: 147), (YPK(X 7 )K; SEQ 
ID NO: 148), (YPK(X n )K; SEQ ID NO: 149), or (PKN(X 9 )K; SEQ ID NO: 150)). Preferably, the AI 
surface proteins of the invention include a pilin motif amino acid sequence. 

Typically, AI surface proteins of the invention will contain an N-terminal leader or secretion 
signal to facilitate translocation of the surface protein across the bacterial membrane. 

Group B Streptococci are known to colonize the urinary tract, the lower gastrointestinal tract 
and the upper respiratory tract in humans. Electron micrograph images of GBS infection of a cervical 
epithelial cell line (ME180) are presented in Figure 25. As shown in these images, the bacteria 
closely associate with tight junctions between the cells and appear to cross the monolayer by a 
paracellular route. Similar paraceliular invasion of ME1 80 cells is also shown in the contrast images 
in Figure 26. The AI surface proteins of the invention may effect the ability of the GBS bacteria to 
adhere to and invade epithelial cells. AI surface proteins may also affect the ability of GBS to 
translocate through an epithelial cell layer. Preferably, one or more AI surface proteins are capable of 
binding to or otherwise associating with an epithelial cell surface. 

Applicants have discovered that AM surface protein GBS 104 can bind epithelial cells such 

as ME 180 human cervical cells, A549 human lung cells and Caco2 human intestinal cells (See 
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FffiU&lVaiW GBS 104 sequence in a GBS strain reduces the capacity 

of GBS to adhere to ME180 cervical epithelial cells. (See Figures 30 and 211). Deletion of GBS 104 

also reduces the capacity of GBS to invade J774 macrophage-like cells. (See Figures 32 and 205). 

Deletion of GBS 104 also causes GBS to translocate through epithelial monolayers less efficiently. 
5 See Figure 206. GBS 104 protein therefore appears to bind to ME 180 epithelial cells and to have a 

role in adhesion to epithelial cells and macrophage cell lines. 

Similar to the GBS bacteria that are deletion mutants for GBS 104, GBS 80 knockout mutant 

strains also partially lose the ability to translocate through an epithelial monolayer. See Figure 207. 

Deletion of either GBS 80 or GBS 104 in COH1 cells. diminishes adherence to HUVEC endothelial 
10 cells. See Figure 208. Deletion of GBS 80 or GBS 104 in COH1 does not, however, affect growth of 

COH1 either with ME180 cells or in incubation medium (IM). See Figure 209. Both GBS 80 and 

GBS 104, therefore, appear to be involved in translocation of GBS through epithelial cells. 

GBS 80 does not appear to bind to epithelial cells. Incubation of epithelial cells in the 

presence of GBS 80 protein followed by FACS analysis using an anti-GBS 80 polyclonal antibody did 
15 not detect GBS 80 binding to the epithelial cells. See Figure 202. Furthermore, deletion of GBS 80 

protein does not affect the ability of GBS to adhere and invade ME180 cervical epithelial cells. See 

Figure 203 

Preferably, one or more of the surface proteins may bind to one or more extracellular matrix 
(ECM) binding proteins, such as fibrinogen, fibronectin, or collagen. As shown in Figures 5 and 204, 

20 and Example 1, GBS 80, one of the AI-1 surface proteins, can bind to the extracellular matrix binding 
proteins fibronectin and fibrinogen. While GBS 80 protein apparently does not bind to certain 
epithelial cells or affect the capacity of a GBS bacteria to adhere to or invade cervical epithelial cells 
(See Figures 27 and'28), removal of GBS 80 from a wild type strain decreases the ability of that strain 
to translocate through an epithelial cell layer (see Figure 31). 

25 GBS 80 may also be involved in formation of biofilms. COH1 bacteria overexpressing GBS 

80 protein have an impaired ability to translocate through an epithelial monolayer. See Figure 212. 
These COH1 bacteria overexpressing GBS 80 form microcolonies on epithelial cells. See Figures 213 
and 214. These microcolonies may be the initiation of biofilm development. 

Al Surface proteins may also demonstrate functional homology to previously identified 

30 adhesion proteins or extracellular matrix (ECM) binding proteins. For example, GBS 80, a surface 
protein in AI-1, exhibits some functional homology to FimA, a major fimbrial subunit of a Gram 
positive bacteria A. naeslundii. FimA is thought to be involved in binding salivary proteins and may 
be a component in a fimbrae on the surface of A, naeslundii. See Yeung et ah (1997) Infection 8c 
Immunity 65:2629-2639; Yeunge et al (1998) J. Bacteriol 66: 1482-1491; Yeung et ah (1988) J. 

35 Bacteriol 170:3803 - 3809; and Li et ah (2001) Infection & Immunity 69:7224-7233. 

A similar functional homology has also been identified between GBS 80 and proteins 

involved in pili formation in the Gram positive bacteria Corynebacterium diphtheriae (SpaA, SpaD, 

and SpaH). See, Ton-That et ah (2003) Molecular Microbiology 50(4): 1429-1438 and Ton-That et ah 
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ibllgy l §Si[ ; ijl&9-261. The C. diphtheriae proteins all included a pilin motif 




of WxxxV xVYPK (SEQ ID NO: 151; where x indicates a varying amino acid residue). The lysine 
(K) residue is particularly conserved in the C. diphtheriae pilus proteins and is thought to be involved 
in sortase catalized oligomerization of the subunits involved in the C diphtheriae pilus structure. 
5 (The C. diphtheriae pilin subunit SpaA is thought to occur by sortase-catalyzed amide bond cross- 
linking of adjacent pilin subunits. As the thioester-linked acyl intermediate of sortase requires 
nucleophilic attack for release, the conserved lysine within the SpaA pilin motif might function as an 
amino group acceptor of cleaved sorting signals, thereby providing for covalent linkages of the G 
diphtheria pilin subunits. See Figure 6(d) of Ton-That et al., Molecular Microbiology (2003) 

10 50(4): 1429-1438.) 

In addition, an "E box" comprising a conserved glutamic acid residue has also been identified 
in the C. diphtheria pilin associated proteins as important in C. diphtheria pilin assembly. The E box 
motif generally comprises YxLxETxAPxGY (SEQ ID NO: 152; where x indicates a varying amino 
acid residue). In particular, the conserved glutamic acid residue within the E box is thought necessary 

15 for C. diphtheria pilus formation. 

Preferably, the AI-1 polypeptides of the immunogenic compositions comprise an E box motif. 
Some examples of E box motifs in the AI-1 polypeptides may include the amino acid sequences 
YxLxExxxxxGY (SEQ ID NO: 1 53), YxLxExxxPxGY (SEQ ID NO: 1 54), or YxLxETxAPxGY 
(SEQ ID NO: 152). Specifically, the E box motif of the polypeptides may comprise the amino acid 

20 sequences YKLKETKAPEGY (SEQ ID NO: 155), YVLKEIETQSGY (SEQ ID NO: 156), or 
YKLYEIS SPDG Y (SEQ ID NO: 157). 

As discussed in more detail below, a pilin motif containing a conserved lysine residue and an 
E box motif containing a conserved glutamic acid residue have both been identified in GBS 80. 



25 surface of streptococci, (see, e.g., Ton-That et al., Molecular Microbiology (2003) 50(4): 1429 - 
1438), these structures have not been previously visible in negative stain (non-specific) electron 



micrographs of GBS serotype III, strain isolate COH1 with a plasmid insert to facilitate the 

overexpression of GBS 80. This EM photo was produced with a standard negative stain - no pilus 
30 structures are distinguishable. In addition, the use of such Al surface proteins in immunogenic 

compositions for the treatment or prevention of infection against a Gram positive bacteria has not 

been previously described. 

Surprisingly, Applicants have now identified the presence of GBS 80 in surface exposed pilus 

formations visible in electron micrographs. These structures are only visible when the electron 
35 micrographs are specifically stained against an Al surface protein such as GBS 80. Examples of these 

electron micrographs are shown in Figures 11,16 and 17, which reveal the presence of pilus 

structures in wild type COH1 Streptococcus agalactiae. Other examples of these electron 



While previous publications have speculated that pilus-like structures might be formed on the 



micrographs, throwing such speculations into doubt. For example, Figure 34 presents electron 
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njgiiEgrj&pfas yeiliMMiln^iiS'Sl, iSkfih reveals that GBS 80 is associated with pili in a wild type 
clinical isolate of S. agalactiae, JM9030013. (See figure 49.) 

Applicants have also constructed mutant GBS strains containing a plasmid comprising the 
GBS 80 sequence resulting in the overexpression of GBS 80 within this mutant. The electron 
5 micrographs of Figures 13-15 are also stained against GBS 80 and reveal long, oligomeric structures 
containing GBS 80 which appear to cover portions of the surface of the bacteria and stretch far out 
into the supernatant. 

In some instances, the formation of pili structures on GBS appears to be correlated to surface 
- expression of GBS 80. Figure 61 provides FAC analysis of GBS 80 surface levels on bacterial strains 

10 COH1 and JM9130013 using an anti-GBS 80 antisera. Immunogold electron microscopy of the 

COH1 and JM9130013 bacteria using anti-GBS 80 antisera demonstrates that JM9130013 bacteria, 
which have higher values for GBS 80 surface expression, also form longer pili structures. 

The surface exposure of GBS 80 on GBS is generally not capsule-dependent. Figure 62 
provides FACS analysis of capsulated and uncapsulated GBS analyzed with anti-GBS 80 and anti- 

15 GBS 322 antibodies. Surface exposure of GBS 80, unlike GBS 322, is not capsule dependent. 

An Adhesin Island surface protein, such as GBS 80 appears to be required for pili formation, 
as well as an Adhesin Island sortase. Pili are formed in Cohl bacterial clones that overexpress GBS 
80, but lack GBS 104, or one of the AI-1 sortases sag0647 or sag0648. However, pili are not formed 
in Cohl bacterial clones that overexpress GBS 80 and lack both sag0647 and sag0648. Thus, for 

20 example, it appears that at least GBS 80 and a sortase, sag0647 or sag0648, may be necessary for pili 
formation. (See Figure 48.) Overexpression of GBS 80 in GBS strain 515, which lacks an AI-1, also 
assembles GBS 80 into pili. GBS strain 515 contains an AI-2, and thus AI-2 sortases. The AI-2 
sortases in GBS strain 515 apparently polymerize GBS 80 into pili. (See Figure 42.) Overexpression 
of GBS 80 in GBS strain 515 cell knocked out for GBS 67 expression also apparently polymerizes 

25 GBS 80 into pili. (See Figure 72.) 

While GBS 80 appears to be required for GBS AI-1 pili formation, GBS 104 and sortase 
SAG0648 appears to be important for efficent AI-1 pili assembly. For example, high-molecular 
structures are not assembled in isogenic COH1 strains which lack expression of GBS 80 due to gene 
disruption and are less efficiently assembled in isogenic COH1 strains which lack the expression of 

30 GBS 104 (see Figure 41). This GBS strain comprises high molecular weight pili structures composed 
of covalently linked GBS 80 and GBS 104 subunits. In addition, deleting SAG0648 in COH1 
bacteria interferes with assembly of some of the high molecular weight pili structures. Thus, 
indicating that SAG0648 plays a role in assembly of these pilin species. (See Figure 41). 
EM photos confirm the involvement of AI surface protein GBS 104 within the 

35 hyperoligomeric structures of a GBS strain adapted for increased GBS 80 expression. (See Figures 34 
- 41 and Example 6). In a wild type serotype VIII GBS strain, strain JM9030013, IEM identifies 
GBS 104 as forming clusters on the bacterial surface. (See Figure 50.) 
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P C IxBS fellBisOaj^ of the GBS pili. Immunoblots using an anti-GBS 80 

antisera on total cell extracts of Cohl and a GBS 52 null mutant Cohl reveal a shift in detected 

proteins in the Cohl wild type strain relative to the GBS 52 null mutant Cohl strain. The shifted 

proteins were also detected in the wild type Cohl bacteria with an anti-GBS 52 antisera, indicating 

5 that the GBS 52 may be present in the pilus. (See Figure 45.) 

In one embodiment, the invention includes a composition comprising oligomeric, pilus-like 

structures comprising an AI surface protein such as GBS 80. The oligomeric, pilus-like structure may 

comprise numerous units of AI surface protein. Preferably, the oligomeric, pilus-like structures 

comprise two or more AI surface proteins. Still more preferably, the oligomeric, pilus-like structure 

10 comprises a hyper-oligomeric pilus-like structure comprising at least two (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 

10, 1 1, 12, 13, 14, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 120, 140, 150, 200 or more) 

oligomeric subunits, wherein each subunit comprises an AI surface protein or a fragment thereof. The 

oligomeric subunits may be covalently associated via a conserved lysine within a pilin motif. The 

oligomeric subunits may be covalently associated via an LPXTG motif, preferably, via the threonine 

1 5 amino acid residue. 

AI surface proteins or fragments thereof to be incorporated into the oligomeric, pilus-like 

structures of the invention will preferably include one or both of a pilin motif comprising a conserved 

lysine residue and an E box motif comprising a conserved glutamic acid residue. 

More than one AI surface protein may be present in the oligomeric, pilus-like structures of the 

20 invention. For example, GBS 80 and GBS 104 may be incorporated into an oligomeric structure. 

Alternatively, GBS 80 and GBS 52 may be incorporated into an oligomeric structure, or GBS 80, 

GBS 104 and GBS 52 may be incorporated into an oligomeric structure. 

In another embodiment, the invention includes compositions comprising two or more AI 

surface proteins. The composition may include surface proteins from the same adhesin island. For 

25 example, the composition may include two or more GBS AH surface proteins, such as GBS 80, GBS 

104 and GBS 52. The surface proteins may be isolated from Gram positve bacteria or they may be 

produced recombinantly. 

The oligomeric, pilus like structures may be used alone or in the combinations of the 

invention. In one embodiment, the invention comprises a GBS Adhesin Island protein in oligomeric 

30 form, preferably in a hyperoligomeric form. In one embodiment, the invention comprises a 

composition comprising one or more GBS Adhesin Island 1 ("AI-1") proteins and one or more GBS 

Adhesin Island 2 ("AI-2") proteins, wherein one or more of the Adhesin Island proteins is in the form 

of an oligomer, preferably in a hyperoligomeric form. 

The oligomeric, pilus-like structures of the invention may be combined with one or more 

35 additional GBS proteins. In one embodiment, the oligomeric, pilus-like structures comprise one or 

more AI surface proteins in combination with a second GBS protein. The second GBS protein may 

be a known GBS antigen, such as GBS 322 (commonly referred to as "sip") or GBS 276. Nucleotide 

and amino acid sequences of GBS 322 sequenced from serotype V isolated strain 2603 V/R are set 
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fiM^mWd'MEiWii SE^lE iiiPSi SEQ ID 8540 and in the present specification as SEQ ID 
NOs: 38 and 39. A particularly preferred GBS 322 polypeptide lacks the N-terminal signal peptide, 
amino acid residues 1-24. An example of a preferred GBS 322 polypeptide is a 407 amino acid 
fragment and is shown in SEQ ID NO: 40. Examples of preferred GBS 322 polypeptides are further 
5 described in PCTUS04/ attorney docket number PP20665.002 filed September 15, 2004, 

hereby incorporated by reference, published as WO 2005/002619. 

Additional GBS proteins which may be combined with the GBS AI surface proteins of the 
invention are also described in WO 2005/002619. These GBS proteins include GBS 91, GBS 184, 
GBS 305, GBS 330, GBS 338, GBS 361, GBS 404, GBS 690, and GBS 691. 

10 Additional GBS proteins which may be combined with the GBS AI surface proteins of the 

invention are described in WO 02/3477 1 . 

GBS polysaccharides which may be combined with the GBS AI surface proteins of the 
invention are described in WO 2004/041 157. For example, the GBS AI surface proteins of the 
invention may be combined with a GBS polysaccharides selected from the group consisting of 

15 serotype la, lb, Ia/c, II, III, IV, V, VI, VII and VIII. 

The oligomeric, pilus-like structures may be isolated or purified from bacterial cultures in 
which the bacteria express an AI surface protein. The invention therefore includes a method for 
manufacturing an oligomeric AI surface antigen comprising culturing a GBS bacterium that expresses 
the oligomeric AI protein and isolating the expressed oligomeric AI protein from the GBS bacteria. 

20 The AI protein may be collected from secretions into the supernatant or it may be purified from the 
bacterial surface. The method may further comprise purification of the expressed AI protein. 
Preferably, the AI protein is in a hyperoligomeric form. Macromolecular structures associated with 
oligomeric pili are observed in the supernatant of cultured GBS strain Cohl. (See Figure 46.) These 
pili are found in the supernatant at all growth phases of the cultured Cohl bacteria. (See Figure 47.) 

25 The oligomeric, pilus-like structures may be isolated or purified from bacterial cultures 

overexpressing an AI surface protein. The invention therefore includes a method for manufacturing 
an oligomeric Adhesin Island surface antigen comprising culturing a GBS bacterium adapted for 
increased AI protein expression and isolation of the expressed oligomeric Adhesin Island protein from 
the GBS bacteria. The AI protein may be collected from secretions into the supernatant or it may be 

30 purified from the bacterial surface. The method may further comprise purification of the expressed 
Adhesin Island protein. Preferably, the Adhesin Island protein is in a hyperoligomeric form. 

The GBS bacteria are preferably adapted to increase AI protein expression by at least two 
(e.g., 2, 3, 4, 5, 8, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 125, 150 or 200) times wild 
type expression levels. 

35 GBS bacteria may be adapted to increase AI protein expression by any means known in the 

art, including methods of increasing gene dosage and methods of gene upregulation. Such means 

include, for example, transformation of the GBS bacteria with a plasmid encoding the AI protein. The 

plasmid may include a strong promoter or it may include multiple copies of the sequence encoding the 
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>#&Kt^ the AI protein within the GBS bacterial genome may 

be deleted. Alternatively, or in addition, the promoter regulating the GBS Adhesin Island may be 
modified to increase expression. 

GBS bacteria harbouring a GBS AI-1 may also be adapted to increase AI protein expression 
5 by altering the number adenosine nucleotides present at two sites in the intergenic region between 
AraC and GBS 80. See Figure 197 A, which is a schematic showing the organization of GBS AI-1 
and Figure 197 B, which provides the sequence of the intergenic region between AraC and GBS 80 in 
the AL The adenosine tracts which applicants have identified as influencing GBS 80 surface 
expression are at nucleotide positions 187 and 233 of the sequence shown in Figure 197 B (SEQ ID 

10 NO: 273). Applicants determined the influence of these adenosine tracts on GBS 80 surface 

expression in strains of GBS bacteria harboring four adenosines at position 187 and six adenosines at 
position 233, five adenosines at position 187 and six adenosines position 233, and five adenosines at 
position 187 and seven adenosines at position 233. FACS analysis of these strains using anti GBS 80 
antiserum determined that an intergenic region with five adenosines at position 187 and six 

15 adenosines at position 233 had higher expression levels of GBS 80 on their surface than other stains. 
See Figure 197 C for results obtained from the FACS analysis. Therefore, manipulating the number 
of adenosines present at positions 187 and 233 of the AraC and GBS 80 intergenic region may further 
be used to adapt GBS to increase AI protein expression. 

The invention further includes GBS bacteria which have been adapted to produce increased 

20 levels of AI surface protein. In particular, the invention includes GBS bacteria which have been 
adapted to produce oligomeric or hyperoligomeric AI surface protein, such as GBS 80. In one 
embodiment, the Gram positive bacteria of the invention are inactivated or attenuated to permit in 
vivo delivery of the whole bacteria, with the AI surface protein exposed on its surface. 

The invention further includes GBS bacteria which have been adapted to have increased 

25 levels of expressed AI protein incorporated in pili on their surface. The GBS bacteria may be adapted 
to have increased exposure of oligomeric or hyperoligomeric AI proteins on its surface by increasing 
expression levels of a signal peptidase polypeptide. Increased levels of a local signal peptidase 
expression in Gram positive bacteria (such us LepA in GAS) are expected to result in increased 
exposure of pili proteins on the surface of Gram positive bacteria. Increased expression of a leader 

30 peptidase in GBS may be achieved by any means known in the art, such as increasing gene dosage 
and methods of gene upregulation. The GBS bacteria adapted to have increased levels of leader 
peptidase may additionally be adapted to express increased levels of at least one pili protein. 

Alternatively, the AI proteins of the invention may be expressed on the surface of a non- 
pathogenic Gram positive bacteria, such as Streptococus gordonii (See, e.g., Byrd et aL, "Biological 

35 consequences of antigen and cytokine co-expression by recombinant Streptococcus gordonii vaccine 

vectors", Vaccine (2002) 20:2197-2205) or Lactococcus lactis (See, e.g., Mannam et aL, "Mucosal 

Vaccine Made from Live, Recombinant Lactococcus lactis Protects Mice against Pharangeal Infection 

with Streptococcus pyogenes" Infection and Immunity (2004) 72(6):3444-3450). As used herein, 
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ilSW^ iaflieril refer to Gram positive bacteria which are compatible with a 

human host subject and are not associated with human pathogenisis. Preferably, the non-pathogenic 
bacteria are modified to express the AI surface protein in oligomeric, or hyper-oligomeric form. 
Sequences encoding for an AI surface protein and, optionally, an AI sortase, may be integrated into 
5 the non-pathogenic Gram positive bacterial genome or inserted into a plasmid. The non-pathogenic 
Gram positive bacteria may be inactivated or attenuated to facilitate in vivo delivery of the whole 
bacteria, with the AI surface protein exposed on its surface. Alternatively, the AI surface protein may 
be isolated or purified from a bacterial culture of the non-pathogenic Gram positive bacteria. For 
example, the AI surface protein may be isolated from cell extracts or culture supernatants. 

10 Alternatively, the AI surface protein may be isolated or purified from the surface of the non- 
pathogenic Gram positive bacteria. 

The non-pathogenic Gram positive bacteria may be used to express any of the Gram positive 
bacterial Adhesin Island proteins described herein, including proteins from a GBS Adhesin Island, a 
GAS Adhesin Island, or a S pneumo Adhesin Island. The non-pathogenic Gram positive bacteria are 

1 5 transformed to express an Adhesin Island surface protein. Preferably, the non-pathogenic Gram 

positive bacteria also express at least one Adhesin Island sortase. The AI transformed non-pathogenic 
Gram positive bacteria of the invention may be used to prevent or treat infection with a pathogenic 
Gram positive bacteria, such as GBS, GAS or Streptococcus pneumoniae. The non-pathogenic Gram 
positive bacteria may express the Gram positive bacterial Adheshin Island proteins in oligomeric 

20 forms that further comprise adhesin island proteins encoded within the genome of the non-pathogenic 
Gram positive bacteria. 

Applicants modified L. lactis to demonstrate that it can express GBS AI polypeptides. L. 
lactis was transformed with a construct encoding GBS 80 under its own promoter and terminator 
sequences. The transformed L. lactis appeared to express GBS 80 as shown by Western blot analysis 

25 using anti-GBS 80 antiserum. See lanes 6 and 7 of the Western Blots provided in Figures 133A and 
133B (133 A and 133B are two different exposures of the same Western blot). See also Example 13. 

Applicants also transformed L. lactis with a construct encoding GBS AI-1 polypeptides GBS 
80, GBS 52, SAG0647, SAG0648, and GBS 104 under the GBS 80 promoter and terminator 
sequences. These L. lactis expressed high molecular weight structures that were immunoreactive with 

30 anti-GBS 80 in immunoblots. See Figure 134, lane 2, which shows detection of a GBS 80 monomer 
and higher molecular weight polymers in total transformed L. lactis extracts. Thus, it appeared that L. 
lactis is capable of expressing GBS 80 in oligomeric form. The high molecular weight polymers were 
not only detected in L. lactis extracts, but also in the culture supernatants. See Figure 135 at lane 4. 
See also Example 14. Thus, the GBS AI polypeptides in oligomeric form can be isolated and purified 

35 from either L. lactis cell extracts or culture supernatants. These oligomeric forms can, for instance, be 

isolated from cell extracts or culture supernatants by release by sonication. See Figure 136A and B. 

See also Figure 171, which shows purification of GBS pili from whole extracts of L. lactis expressing 

the GBS AI-1 following sonication and gel filtration on a Sephacryl HR 400 column. 
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P C "f u^ i ^^i!s^}rEilffeHie < Z , ! EBc35 iSSHSftned with the construct encoding GBS AI-1 polypeptides 
GBS 80, GBS 52, SAG0647, SAG0648, and GBS 104 under the GBS 80 promoter and terminator 
sequences expressed the GBS AI-1 polypeptides on its surface. FACS analysis of these transformed 
L. lactis detected cell surface expression of both GBS 80 and GBS 104. The surface expression levels 
5 of GBS 80 and GBS 104 on the transformed L. lactis were similar to the surface expression levels of 
GBS 80 and GBS 104 on GBS strains COHl and JM9130013, which naturally express GBS AI-1. 
See Figure 169 for FACS analysis data for L. lactis transformed with GBS AM and wildtype 
JM9130013 bacteria using anti-GBS 80 and GBS 104 antisera. Table 40 provides the results of FACS 
analysis of transformed L. lactis, COHl, and JM9130013 bacteria using anti-GBS 80 and anti-GBS 
10 104 antisera. The numbers provided represent the mean fluorescence value difference calculated for 
immune versus pre-immune sera obtained for each bacterial strain. 



Table 40: FACS analysis of Z. lactis and GBS bacteria strains expressing GBS AI-1 





Anti-GBS 80 antiserum 


Anti-GBS 104 antiserum 


GBS AI-1 transformed L. lactis 


298 


251 


GBS COHl 


305 


305 


GBS JM9130013 


461 


355 



Immunogold-electronmicroscopy performed with anti-GBS 80 primary antibodies detected the 
presence of pilus structures on the surface of the L. lactis bacteria expressing GBS AI-1, confirming 
15 the results of the FACS analysis. See Figure 168 B and C. Interestingly, this expression of GBS pili 
on the surface of the L. lactis induced L. lactis aggregation. See Figure 170. Thus, GBS AI 
polypeptides may also be isolated and purified from the surface of L. lactis. The ability of L. lactis to 
express GBS AI polypeptides on its surface also demonstrates that it may be useful as a host to deliver 
GBS AI antigens. 

20 In fact, immunization of mice with L. lactis transformed with GBS AI-1 was protective in a 

subsequent challenge with GBS. Female mice were immunized with L. lactis transformed with GBS 
AI-1. The immunized female mice were bred and their pups were challenged with a dose of GBS 
sufficient to kill 90% of non-immunized pups. Detailed protocols for intranasal and subcutaneous 
immunization of mice with transformed L. lactis can be found in Examples 18 and 19, respectively. 

25 Table 43 provides data showing that immunization of the female mice with L. lactis expressing GBS 
AI-1 (LL-AI 1) greatly increased survival rate of challenged pups relative to both a negative PBS 
control (PBS) and a negatived, lactis control (LL 10 E9, which is wild type L. lactis not transformed 
to express GBS AI-1). 



Table 43: Protection of Mice Immunized with L. lactis expressing GBS AI-1 



Immunization 
Route 


Antigen 


Alive/Treated 


Survival 

% 


Survival 
% Range 


p value 


Intraperitoneum 


Recombinant GBS 80 


16/18 


89 


80-100 


<0.001 


Subcutaneous 


LL-AI 1 10 E9 


40/49 


82 


70-90 


<0.001 


LL-AI 1 10E10 


50/60 


83 


60-100 


<0.001 


PBS 


4/30 


13 


0-30 




LL 10 E9 


3/57 


5 


0-20 




Intranasal 


LL-AI 1 10 E9 


22/60 


37 


0-100 


0.02 
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31/49 


63 


30-90 


O.001 


LL 10 E9 


2/27 


7 


0-20 





Table 51 provides further evidence that immunization of mice withZ. lactis transformed with 
GBS AI-1 is protective against GBS. 



Table 51: Further Protection of Mice Immunized with L. lactis expressing GBS AI-1 



Antigen 


Iiwnunizatrph , 


Al i\^/Tredte^ 


Survival % 








(Aral ^.CK3k>pqoi)^ 


Recombinant GBS 80 


IP 


48/50 


92 


Recombinant GBS 80 


SC 


21/30 


70 


L. lactis* ATI 1QT cfu 


sc 


6/66 


9 


LJacfis+ALl 10 7 cfu 


SC 


47/70 


73 


LJactfs+AZl 10 8 cfu 


SC 


116/153 


76 


LJact/s+ATl 10 9 cfu 


sc 


98/118 


83 


LJactis+AXl 10 10 cfu 


sc 


107/129 


83 


L. lactis 10 10 cfu 


sc 


4/83 


5 


PBS 


sc 


6/110 


5 






51/97 


52 


uiacrtfe i^ 1 cfu ; v ':; 






7 


PBS \-^/v 









Protection of immunized mice with L. lactis expressing the GBS AI-1 is at least partly due to 
a newly raised antibody response. Table 46 provides anti-GBS 80 antibody titers detected in serum of 
the mice immunized within lactis expressing the GBS AI-1 as described above. Mice immunized 
withZ. lactis expressing the GBS AI-1 have anti-GBS 80 antibody titres, which are not observed in 
mice immunized with L. lactis not transformed to express the GBS AI-1. Further, as expected from 
the survival data, mice subcutaneously immunized withZ. lactis transformed to express the GBS AI-1 
have significantly higher serum anti-GBS 80 antibody titers than mice intranasally immunized with L. 
lactis transformed to express the GBS AI-1. 



Table 46: Antibody Responses against GBS 80 in Serum of Mice Immunized with L. 

lactis Expressing GBS AI-1 



Antigen 


Ab Titre Obtained Following 


Subcutaneous 
Immunization 


Intranasal 
Immunization 


Intraperitoneal 
Immunization 


LL 10 E9 


0 


0 




LL-AI 1 10 E9 


14000 


50 




LL-AI 1 10E10 


25000 


406 




Recombinant GBS 80 






120000 



Anti-GBS 80 antibodies of the IgA isotype were specifically detected in various body fluids 

of the mice subcutaneously or intranasally immunized withZ. lactis expressing the GBS AI-L 

Table 47: Anti-GBS 80 IgA Antibodies Detected in Mouse Tissues Following 
Immunization with L. lactis Expressing GBS AI-1 
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Anti-GBS 80 IgA Antibodies Detected in 


Serum 


Vaginal Wash 


Nasal Wash 


LL 10 E9 




0 


0 


0 


LL-AI 1 


Subcutaneous 


0 


25 


20 


LL-AI 1 


Intranasal 


140 


0 


150 


GBS 80 


Intraperitoneal 


60 


0 





Furthermore, opsonophagocytosis assays also demonstrated that at least some of the 
antiserum produced against the L. lactis expressing GBS AI 1 is opsonic for GBS. See Figure 161. 

To obtain protection of against GBS across a greater number of strains and serotypes, it is 
possible to transform L. lactis with a recombinant GBS AI encoding both GBS AI-1 and AI-2, i.e., a 
5 hybrid GBS AI. By way of example, a hybrid GBS AI may be a GBS AI-1 with a replacement of the 
GBS 104 gene with a GBS 67 gene. A schematic of such a hybrid GBS AI is depicted in Figure 23 1 
A, A hybrid GBS AI may alternatively be a GBS AM with a replacement of the GBS 52 gene with a 
GBS 59 gene. See the schematic at Figure 23 1 B. Alternatively, a hybrid GBS AI may be a GBS AI- 
1 with a substitution of a GBS 59 polypeptide for the GBS 52 gene and a substitution of the GBS 104 

10 gene for genes encoding GBS 59 and the two GBS AI-2 sortases. Another example of a hybrid GBS 
AI is a GBS AI-1 with the substitution of a GBS 59 gene for the GBS 52 gene and a GBS 67 for the 
GBS 104 gene. See the schematic at Figure 232. A further example of a hybrid GBS AI is a GBS AI- 
1 having a GBS 59 gene and genes encoding the GBS AI-2 sortases in place of the GBS 52 gene. Yet 
another example of a hybrid GBS AI is a GBS AI-1 with a substitution of either GBS 52 or GBS 104 

15 with a fusion protein comprising GBS 322 and one of GBS 59, GBS 67, or GBS 150. Some of these 
hybrid GBS AIs may be prepared as briefly outlined in Figure 234 A-F. 

Applicants have prepared a hybrid GBS AI having a GBS AI-1 sequence with a substitution 
of a GBS 67 coding sequence for the GBS 104 gene as depicted in Figure 231 A. Transformation of 
L. lactis with the hybrid GBS AI-1 resulted in L, lactis expression of high molecular weight polymers 

20 containing the GBS 80 and GBS 67 proteins. See Figure 233 A, which provides Western blot analysis 
of L. lactis transformed with the hybrid GBS AI depicted in Figure 23 1 A. When L. lactis 
transformed with the hybrid GBS AI were probed with antibodies to GBS 80 or GBS 67, high 
molecular weight structures were detected. See lanes labelled LL 4- a) in both the oc-80 and oc-67 
irnmunoblots. The GBS 80 and GBS 67 proteins were confirmed to be present on the surface of L. 

25 lactis by FACS analysis. See Figure 233 B, which shows a shift in fluorescence when GBS 80 and 
GBS 67 antibodies are used to detect GBS 80 and GBS 67 surface expression. The same shifts in 
fluorescence were not observed in L. lactis control cells, cells not transformed with the hybrid GBS 
AI. 

Alternatively, the oligomeric, pilus-like structures may be produced recombinantly. If 
30 produced in a recombinant host cell system, the AI surface protein will preferably be expressed in 

coordination with the expression of one or more of the AI sortases of the invention. Such AI sortases 
will facilitate oligomeric or hyperoligomeric formation of the AI surface protein subunits. 
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ii J ' E-'SolAS have a signal peptide sequence within the first 70 

amino acid residues. They may also include a transmembrane sequence within 50 amino acid 
residues of the C terminus. The sortases may also include at least one basic amino acid residue within 
the last 8 amino acids. Preferably, the sortases have one or more active site residues, such as a 
catalytic cysteine and histidine. 

As shown in Figure 1, AI-1 includes the surface exposed proteins of GBS 80, GBS 52 and 
GBS 104 and the sortases SAG0647 and SAG0648. AI-1 typically appears as an insertion into the 3' 
end of the trmA gene. 

In addition to the open reading frames encoding the AI-1 proteins, AI-1 may also include a 
divergently transcribed transcriptional regulator such as araC {i.e., the transcriptional regulator is 
located near or adjacent to the AI protein open reading frames, but it transcribed in the opposite 
direction). It is believed that araC may regulate the expression of the AI operon. {See Korbel et aL, 
Nature Biotechnology (2004) 22(7): 91 1 - 917 for a discussion of divergently transcribed regulators 
in E. coli). 

AI-1 may also include a sequence encoding a rho independent transcriptional terminator (see 
hairpin structure in Figure 1). The presence of this structure within the adhesin island is thought to 
interrupt transcription after the GBS 80 open reading frame, leading to increased expression of this 
surface protein. 

A schematic identifying AI-1 within several GBS serotypes is depicted in Figure 2. AI-1 
sequences were identified in GBS serotype V, strain isolate 2603; GBS serotype III, strain isolate 
NEM3 16; GBS serotype II, strain isolate 18RS21; GBS serotype V, strain isolate CJB1 11; GBS 
serotype III, strain isolate COH1 and GBS serotype la, strain isolate A909. (Percentages shown are 
amino acid identity to the 2603 sequence). (An AI-1 was not identified in GBS serotype lb, strain 
isolate H36B or GBS serotype la, strain isolate 515). 

An alignment of AI-1 polynucleotide sequences from serotype V, strain isolates 2603 and 

CJB111; serotype II, strain isolate 18RS21; serotype III, strain isolates COH1 and NEM316; and 

serotype la, strain isolate A909 is presented in Figure 18. An alignment of amino acid sequences of 

AI-1 surface protein GBS 80 from serotype V, strain isolates 2603 and CJB11 1; serotype la, strain 

isolate A909; serotype III, strain isolates COH1 and NEM3 16 is presented in Figure 22. An 

alignment of amino acid sequences of AI-1 surface protein GBS 104 from serotype V, strain isolates 

2603 and CJB111; serotype III, strain isolates COH1 andNEM316; and serotype II, strain isolate 

18RS21 is presented in Figure 23. Preferred AI-1 polynucleotide and amino acid sequences are 

conserved among two or more GBS serotypes or strain isolates. 

As shown in this figure, the full length of surface protein GBS 80 is particularly conserved 

among GBS serotypes V (strain isolates 2603 and CJBIII), III (strain isolates NEM3 16 and COH1), 

and la (strain isolate A909). The GBS 80 surface protein is missing or fragmented in serotypes II 

(strain isolate 18RS21), lb (strain isolate H36B) and la (strain isolate 515). 

Polynucleotide and amino acid sequences for AraC are set forth in FIGURE 30. 
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A second adhesin island, "Adhesin Island 2" or "AI-2" or "GBS AI-2" has also been 
identified in numerous GBS serotypes. A schematic depicting the correlation between AI-1 and AI-2 
within the GBS serotype V, strain isolate 2603 is shown in Figure 3. (Homology percentages in 
5 Figure 3 represent amino acid identity of the AI-2 proteins to the AI-1 proteins). Alignments of AI-2 
polynucleotide sequences are presented in Figures 20 and 21 (Figure 20 includes sequences from 
serotype V, strain isolate 2603 and serotype III, strain isolate NEM3 16. Figure 21 includes sequences 
from serotype III, strain isolate COH1 and serotype la, strain isolate A909). An alignment of amino 
acid sequences of AI-2 surface protein GBS 067 from serotype V, strain isolates 2603 and CJB1 11; 
10 serotype la, strain isolate 515; serotype II, strain isolate 18RS21; serotype lb, strain isolate H36B; and 
serotype III, strain isolate NEM3 16 is presented in Figure 24. Preferred AI-2 polynucleotide and 
amino acid sequences are conserved among two or more GBS serotypes or strain isolates. 

AI-2 comprises a series of approximately five open reading frames encoding for a collection 
of amino acid sequences comprising surface proteins and sortases. Specifically, AI-2 includes open 

15 reading frames encoding for two or more (i.e., 2, 3, 4, 5 or more) of GBS 67, GBS 59, GBS 150, 

SAG1405, SAG1406, 01520, 01521, 01522, 01523, 01523, 01524 and 01525. In one embodiment, 
AI-2 includes open reading frames encoding for two or more of GBS 67, GBS 59, GBS 150, 
SAG1405, and SAG1406. Alternatively, AI-2 may include open reading frames encoding for two or 
more of 01520, 01521, 01522, 01523, 01523, 01524 and 01525. 

20 One or more of the surface proteins typically include an LPXTG motif (such as LPXTG (SEQ 

ID NO: 122)) or other sortase substrate motif. The GBS AI-2 sortase proteins are thought to be 
involved in the secretion and anchoring of the LPXTG containing surface proteins. GBS AI-2 may 
encode for at least one surface protein. Alternatively, AI-2 may encode for at least two surface 
proteins and at least one sortase. Preferably, GBS AI-2 encodes for at least three surface proteins and 

25 at least two sortases. One or more of the AI-2 surface proteins may include an LPXTG or other 
sortase substrate motif. 

One or more of the surface proteins may also typically include pilin motif. The pilin motif 
may be involved in pili formation. Cleavage of AI surface proteins by sortase between the threonine 
and glycine residue of an LPXTG motif yields a thioester-linked acyl intermediate of sortase. The 

30 first lysine residue in a pilin motif can serve as an amino group acceptor of the cleaved LPXTG motif 
and thereby provide a covalent linkage between AI subunits to form pili. For example, the pilin motif 
can make a nucleophilic attack on the acyl enzyme providing a covalent linkage between AI subunits 
to form pili and regenerate the sortase enzyme. Some examples of pilin motifs that may be present in 
the GBS AI-2 proteins include ((YPKN(X 8 )K; SEQ ID NO: 158), (PK(X 8 )K; SEQ ID NO: 159), 

35 (YPK(X 9 )K;SEQ ID NO: 160), (PKN(X 8 )K; SEQ ID NO: 161), or (PK(X 10 )K; SEQ ID NO: 162)). 

One or more of the surface protein may also include an E box motif. The E box motif 

contains a conserved glutamic acid residue that is believed to be necessary for pilus formation. Some 

examples of E box motifs may include the amino acid sequences YxLxETxAPxG (SEQ ID NO: 163), 
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YxiSJ^^ (SEQ ID NO: 165), or YxLxETxAPxGY 

(SEQIDNO: 152). 

As shown in Figure 3, GBS AI-2 may include the surface exposed proteins of GBS 67, GBS 
59 and GBS 150 and the sortases of SAG1406 and SAG1405. Alternatively, GBS AI-2 may include 
5 the proteins 01521, 01524 and 01525 and sortases 01520 and 01522. GBS 067 and 01524 are 
preferred AI-2 surface proteins. 

AI-2 may also include a divergently transcribed transcriptional regulator such as a RofA like 
protein (for example rogB). As in AI-1, rogB is thought to regulate the expression of the AI-2 operon. 

A schematic depiction of AI-2 within several GBS serotypes is depicted in Figure 4. 
10 (Percentages shown are amino acid identity to the 2603 sequence). While the AI-2 surface proteins 
GBS 59 and GBS 67 are more variable across GBS serotypes than the corresponding AI-1 surface 
proteins, AI-2 surface protein GBS 67 appears to be conserved in GBS serotypes where the AI-1 
surface proteins are disrupted or missing. 

For example, as discussed above and in Figure 2, the AI-1 GBS 80 surface protein is 
15 fragmented in GBS serotype II, strain isolate 18RS21. Within AI-2 for this same sequence, as shown 
in Figure 4, the GBS 67 surface protein has 99% amino acid sequence homology with the 
corresponding sequence in strain isolate 2603. Similarly, the AI-1 GBS 80 surface protein appears to 
be missing in GBS serotype lb, strain isolate H36B and GBS serotype la, strain isolate 515. Within 
AI-2 for these sequences, however, the GBS 67 surface protein has 97 — 99 % amino acid sequence 
20 homology with the corresponding sequence in strain isolate 2603. GBS 67 appears to have two allelic 
variants, which can be divided according to percent homology with strains 2603 and H36B. See 
figures 237-239. 

Unlike for GBS 67, amino acid sequence identity of GBS 59 is variable across different GBS 
strains. As shown in Figures 63 and 224, GBS 59 of GBS strain isolate 2603 shares 100% amino acid 

25 residue homology with GBS strain 18RS21, 62% amino acid sequence homology with GBS strain 
H36B, 48% amino acid residue homology with GBS strain 5 15 and GBS strain CJB1 1 1, and 47% 
amino acid residue homology with GBS strain NEM3 1 6. The amino acid sequence homologies of the 
different GBS strains suggest that there are two isoforms of GBS 59. The first isoform appears to 
include the GBS 59 protein of GBS strains CJB111, NEM316, and 515. The second isoform appears 

30 to include the GBS 59 protein of GBS strains 18RS21, 2603, and H36B. (See Figures 63 and 224.) 

As expected from the variability in GBS 59 isoforms, antibodies specific for the first GBS 59 
isoform detect the first but not the second GBS 59 isoform and antibodies specific for the second GBS 
59 isoform detect the second but not the first GBS 59 isoform. See Figure 226A, which shows FACS 
analysis of 28 GBS strains having a GBS 59 gene detected using PCR for GBS 59 surface expression. 

35 For each of the 28 GBS strains, FACS analysis was performed using either an antibody for GBS 59 
isoform 1 (a-cjbl 1 1) or GBS 59 isoform 2 (a -2603). Only one of the two antibodies detected GBS 
59 surface expression on each GBS strain. As a negative control, GBS strains in which a GBS 59 , 
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gKa'I not W!tisiy&^ Bve significant GBS 59 surface expression levels. Figure 

226B. 

Also, GBS 59 is opsonic only against GBS strains expressing a homologous GBS 59 protein. 
See Figure 225. 

5 In one embodiment, the immunogenic composition of the invention comprises a first and a 

second isoform of the GBS 59 protein to provide protection across a wide range of GBS serotypes that 
express polypeptides from a GBS AI-2. The first isoform may be the GBS 59 protein of GBS strain 
CJB1 1 1, NEM316, or 515. The second isoform may be the GBS 59 protein of GBS strain 18RS21, 
2603, or H36B. 

10 The gene encoding GBS 59 has been identified in a high number of GBS isolates; the GBS 59 

gene was detected in 31 of 40 GBS isolates tested (77.5%). The GBS 59 protein also appears to be 
present as part of a pilus in whole extracts derived from GBS strains. Figure 64 shows detection of 
high molecular weight GBS 59 polymers in whole extracts of GBS strains C JB 1 1 1 , 73 57B, COH3 1 , 
D1363C, 5408, 1999, 5364, 5518, and 515 using antiserum raised against GBS 59 of GBS strain 

15 CJB1 1 1. Figure 65 also shows detection of these high molecular weight GBS 59 polymers in whole 
extracts of GBS strains D136C, 515, and CJB1 1 1 with anti-GBS 59 antiserum. (See also Figure 220 
. A for detection of GBS 59 high molecular weight polymers in strain 515.) Figure 65 confirms the 
presence of different isoforms of GBS 59. Antisera raised against two different GBS 59 isoforms 
results in different patterns of immunoreactivity depending on the GBS strain origin of the whole 

20 extract. Figure 65 further shows detection of GBS 59 monomers in purified GBS 59 preparations. 

GBS 59 is also highly expressed on the surface of GBS strains. GBS 59 was detected on the 
surface of GBS strains CJB111, DK1, DK8, Davis, 515, 2986, 5551, 1 169, and 7357B by FACS 
analysis using mouse antiserum raised against GBS 59 of GBS CJB1 1 1. FACS analysis did not detect 
surface expression of GBS 59 in GBS strains SMU071, JM9130013, and COH1, which do not contain 

25 a GBS 59 gene. (See Figure 66.) Further confirmation that GBS 59 is expressed on the surface of 
GBS is detection of GBS 59 by immuno-electron microscopy on the surface of GBS strain 515 
bacteria. See Figure 215. 

GBS 67 and GBS 150 also appear to be included in high molecular weight structures, or pili. 
Figure 69 shows that anti-GBS 67 and anti-GBS 150 immunoreact with high molecular weight 

30 structures in whole GBS strain 515 extracts. (See also Figure 220 B and C.) It is also notable in 

Figure 69 that the anti-GBS 59 antisera, raised in a mouse following immunization with GBS 59 of 
GBS strain 2603, does not cross-hybridize with GBS 59 in GBS strain 515. GBS 59 of GBS stain 515 
is of a different isotype than GBS 59 of GBS stain 2603. See Figure 63, which illustrates that the 
homology of these two GBS 59 polypeptides is 48%, and Figure 65, which confirms that GBS 59 

35 antisera raised against GBS strain 2603 does not cross-hybridize with GBS 59 of GBS strain 515. 

Formation of pili containing GBS 150 does not appear to require GBS 67 expression. Figure 
70 provides Western blots showing that higher molecular weight structures in GBS strain 515 total 
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oEUtol^ anti-GBS 150 antiserum. In a GBS strain 515 lacking 

GBS 67 expression, anti-GBS 67 antiserum no longer immunoreacts with polypeptides in total 
extracts, while anti-GBS 150 antiserum is still able to cross-hybridze with high molecular weight 
structures. 

5 Likewise, formation of pili containing GBS 59 does not appear to require GBS 67 expression. 

As expected, FACS detects GBS 67 cell surface expression on wildtype GBS strain 515, but not GBS 
strain 515 cells knocked out for GBS 67. FACS analysis using anti-GBS 59 antisera, however, 
detects GBS 59 expression on both the wildtype GBS strain 515 cells and the GBS strain 515 cells 
knocked out for GBS 67. Thus, GBS 59 cell surface expression is detected on GBS stain 515 cells 
10 regardless of GBS 67 expression. 

GBS 67, while present in pili, appears to be localized around the surface of GBS strain 515 
cells. See the immuno-electron micrographs presented in Figure 216. GBS 67 binds to fibronectin. 
See Figure 217. 

Formation of pili encoded by GBS AI-2 does require expression of GBS 59. Deletion of GBS 

15 59 from strain 515 bacteria eliminates detection of high molecular weight structures by antibodies that 
bind to GBS 59 (Figure 221 A, lane 3), GBS 67 (Figure 221 B, lane 3), and GBS 150 (Figure 221 C, 
lane 3). By contrast, Western blot analysis of 515 bacteria with a deletion of the GBS 67 gene detects 
high molecular weight structures using GBS 59 (Figure 221 A, lane 2) and GBS 150 (Figure 221 C, 
lane 2) antisera. Similarly, Western blot analysis of 5 15 bacteria with a deletion of the GBS 150 gene 

20 detects high molecular weight structures using GBS 59 (Figure 221 A, lane 4) and GBS 67 (Figure 
221 B, lane 4). See also Figure 223, which provides Western blots of each of the 515 strains 
interrogated with antibodies for GBS 59, GBS 67, and GBS 150. FACS analysis of strain 515 
bacteria deleted for either GBS 59 or GBS 67 confirms these results. See Figure 222, which shows 
that only deletion of GBS 59 abolishes surface expression of both GBS 59 and GBS 67. 

25 Formation of pili encoded by GBS AI-2 also requires expression of both GBS adhesin island- 

2 encoded sortases. See Figure 218, which provides Western blot analysis of strain 515 bacteria 
lacking Srtl, Srt2, or both Srtl and Srt2. Only deletion of both Srtl and Srt2 abolishes pilus assembly 
as detected by antibodies that cross-hybridize with each of GBS 59, GBS 67 and GBS 150. The 
results of the Western blot analysis were verified by FACS, which provided similar results. See 

30 Figure 219. 

As shown in Figure 4, two of the GBS strain isolates (COH 1 and A909) do not appear to 

contain homologues to the surface proteins GBS 59 and GBS 67. For these two strains, the 

percentages shown in Figure 4 are amino acid identity to the COH1 protein). Notwithstanding the 

difference in the surface protein lengths for these two strains, AI-2 within these sequences still 

35 contains two sortase proteins and three LPXTG containing surface proteins, as well as a signal 

peptidase sequence leading into the first surface protein. One of the surface proteins in this variant of 

AI-2, spbl , has previously been identified as a potential adhesion protein. (See Adderson et al., 

Infection and Immunity (2003) 71(12):6857 - 6863). Alternatively, because of the lack of GBS 59 
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arfi J (®§<^ tiMMMMA may be a third type of AI (Adhesin Island-3, AI-3, or 

GBS AI-3). 

More than one AI surface protein may be present in the oligomeric, pilus-like structures of the 
invention. For example, GBS 59 and GBS 67 may be incorporated into an oligomeric structure. 
5 Alternatively, GBS 59 and GBS 150 may be incorporated into an oligomeric structure, or GBS 59, 
GBS 150 and GBS 67 may be incoiporated into an oligomeric structure. 

In another embodiment, the invention includes compositions comprising two or more AI 
surface proteins. The composition may include surface proteins from the same adhesin island. For 
example, the composition may include two or more GBS AI-2 surface proteins, such as GBS 59, GBS 
10 67 and GBS 150. The surface proteins may be isolated from Gram positve bacteria or they may be 
produced recombinantly. 

GAS Adhesin Islands 

As discussed above, Applicants have identified at least four different GAS Adhesin Islands. 
15 These adhesion islands are thought to encode surface proteins which are important in the bacteria's 

virulence, and Applicants have obtained the first electron micrographs revealing the presence of these 
adhesin island proteins in hyperoligomeric pilus structures on the surface of Group A Streptococcus. 

Group A Streptococcus is a human specific pathogen which causes a wide variety of diseases 
ranging from pharyngitis and impetigo through life threatening invasive disease and necrotizing 
20 fascilitis. In addition, post-streptococcal autoimmune responses are still a major cause of cardiac 
pathology in children. 

Group A Streptococcal infection of its human host can generally occur in three phases. The 
first phase involves attachment and/or invasion of the bacteria into host tissue and multiplication of 
the bacteria within the extracellular spaces. Generally this attachment phase begins in the throat or 

25 the skin. . The deeper the tissue level infected, the more severe the damage that can be caused. In the 
second stage of infection, the bacteria secretes a soluble toxin that diffuses into the surrounding tissue 
or even systemically through the vasculature. This toxin binds to susceptible host cell receptors and 
triggers innappropropriate immune responses by these host cells, resulting in pathology. Because the 
toxin can diffuse throughout the host, the necrosis directly caused by the GAS toxins may be 

30 physically located in sites distant from the bacterial infection. The final phase of GAS infection can 
occur long after the original bacteria have been cleared from the host system. At this stage, the host's 
previous immune response to the GAS bacteria due to cross reactivity between epitopes of a GAS 
surface protein, M, and host tissues, such as the heart. A general review of GAS infection can be 
found in Principles of Bacterial Pathogeneis, Groisman e&, Chapter 15 (2001). 

35 In order to prevent the pathogenic effects associated with the later stages of GAS infection, an 

effective vaccine against GAS will preferably facilitate host elimination of the bacteria during the 
initial attachment and invasion stage. 
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IP Lm IkotatW ©i@aiii|).'A llreptjo^oieui are historically classified according to the M surface 
protein described above. The M protein is surface exposed trypsin-sensitive protein generally 
comprising two polypeptide chains complexed in an alpha helical formation. The carboxyl terminus 
is anchored in the cytoplasmic membrane and is highly conserved among all group A streptococci. 
5 The amino terminus, which extend through the cell wall to the cell surface, is responsible for the 
antigenic variability observed among the 80 or more serotypes of M proteins. 

A second layer of classification is based on a variable, trypsin-resistant surface antigen, 
commonly referred to as the T-antigen. Decades of epidemiology based on M and T serological 
typing have been central to studies on the biological diversity and disease causing potential of Group 

10 A Streptococci. While the M-protein component and its inherent variability have been extensively 
characterized, even after five decades of study, there is still very little known about the structure and 
variability of T-antigens. Antisera to define T types is commercially available from several sources, 
including Sevapharma (http://www.sevapharma.cz/en). 

The gene coding for one form of T-antigen, T-type 6, from an M6 strain of GAS (D741) has 

15 been cloned and characterized and maps to an approximately 1 1 kb highly variable pathogenicity 
island. Schneewind et aL, J Bacteriol. (1990) 172(6):33 10 - 33 17. This island is known as the 
Fibronectin-binding, Collagen-binding T-antigen (FCT) region because it contains, in addition to the 
T6 coding gene (teed), members of a family of genes coding for Extra Cellular Matrix (ECM) binding 
proteins. Bessen et aL, Infection & Immunity (2002) 70(3): 1 159-1 167. Several of the protein 

20 products of this gene family have been shown to directly bind either fibronectin and/or collagen. See 
Hanski et aL, Infection & Immunity (1992) 60(12):51 19-5125; Talay et aL, Infection & Immunity 
(1992( 60(9):3 837-3 844; Jaffe et aL (1996) 21(2):373-384; Rocha et aL, Adv Exp Med Biol. (1997) 
418:737-739; Kreikemeyer et aL, J Biol Chem (2004) 279(16): 15850-15859; Podbielski et aL, MoL 
Microbiol. (1999) 3 1(4): 105 1-64; and Kreikemeyer et aL, Int. J. Med Microbiol (2004) 294(2-3): 177- 

25 88. In some cases direct evidence for a role of these proteins in adhesion and invasion has been 
obtained. 

Applicants raised antiserum against a recombinant product of the teed gene and used it to 
explore the expression of T6 in M6 strain 2724. In immunoblot of mutanolysin extracts of this strain, 
the antiserum recognized, in addition to a band corresponding to the predicted molecular mass of the 

30 product, very high molecular weight ladders ranging in mobility from about 100 kDa to beyond the 
resolution of the 3-8% gradient gels used. 

This pattern of high molecular weight products is similar to that observed in immunoblots of 
the protein components of the pili identified in Streptococcus agalactiae (described above) and 
previously in Corynebacterium diphtherial Electron microscropy of strain M6_2724 with antisera 

35 specific for the product of tee6 revealed abundant surface staining and long pilus like structures 

extending up to 700 nanometers from the bacterial surface, revealing that the T6 protein, one of the 

antigens recognized in the original Lancefiled serotyping system, is located within a GAS Adhesin 

Island (GAS AI-1) and forms long covalently linked pilus structures. 
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IP C Xp]p^^ different Group A Streptococcus Adhesin Islands. 

While these GAS AI sequences can be identified in numerous M types, Applicants have surprisingly 
discovered a correlation between the four main pilus subunits from the four different GAS AI types 
and specific T classifications. While other trypsin-resistant surface exposed proteins are likely also 
5 implicated in the T classification designations, the discovery of the role of the GAS adhesin islands 
(and the associated hyper-oligomeric pilus like structures) in T classification and GAS serotype 
variance has important implications for prevention and treatment of GAS infections. Applicants have 
identified protein components within each of the GAS adhesin islands which are associated with the 
pilus formation. These proteins are believed to be involved in the bacteria's initial adherence 

10 mechanisms. Immunological recognition of these proteins may allow the host immune response to 
slow or prevent the bacteria's transition into the more pathogenic later stages of infection. 

In addition, Applicants have discovered that the GBS pili structures appear to be implicated in 
the formation of biofilms (populations of bacteria growing on a surface, often enclosed in an 
exopolysaccharide matrix). Biofilms are generally associated with bacterial resistance, as antibiotic 

1 5 treatments and host immune response are frequently unable to erradicate all of the bacteria 

components of the biofilm. Direction of a host immune response against surface proteins exposed 
during the first steps of bacterial attachment (i.e., before complete biofilm formation) is preferable. 

The invention therefore provides for improved immunogenic compositions against GAS 
infection which may target GAS bacteria during their initial attachment efforts to the host epithelial 

20 cells and may provide protection against a wide range of GAS serotypes. The immunogenic 

compositions of the invention include GAS AI surface proteins which may be formulated in an 
oligomeric, or hyperoligomeric (pilus) form. The invention also includes combinations of GAS AI 
surface proteins. Combinations of GAS AI surface proteins may be selected from the same adhesin 
island or they may be selected from different GAS adhesin islands. 

25 While there is surprising variability in the number and sequence of the GAS AI components 

across isolates, GAS AI sequences may be generally characterized as Type 1, Type 2, Type 3, and 
Type 4, depending on the number and type of sortase sequence within the island and the percentage 
identity of other proteins within the island. Schematics of the GAS adhesin islands are set forth in 
FIGURE 51A and FIGURE 162.< In all strains identified so far, the adhesin island region is flanked 

30 by highly conserved open reading frames Ml_123 and Ml_136. Between three and five genes in 
each GAS adhesin island code for ECM binding adhesin proteins containing LPXTG motifs. 
GAS Adhesin Island 1 

As discussed above, Applicants have identified adhesin islands, "GAS Adhesin Island 1" or 

"GAS AI-1", within the genome Group A Streptococcus serotypes and isolates. GAS AI-1 comprises 

35 a series of approximately five open reading frames encoding for a collection of amino acid sequences 

comprising surface proteins and sortases ("GAS AI-1 proteins"). GAS AI-1 preferably comprises 

surface proteins, a srtB sortase, and a rofA divergently transcribed transcriptional regulator. GAS AI- 

1 surface proteins may include a Fibronectin binding protein, a collagen adhesion protein and a 
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f&til^ of these GAS AI-1 surface proteins includes an LPXTG 

sortase substrate motif, such as LPXTG (SEQ ID NO: 122) or LPXSG (SEQ ID NO: 134) 
(conservative replacement of threonine with serine). Specifically, GAS AI-1 includes open reading 
frames encoding for two or more (i.e., 2, 3, 4 or 5) of M6_Spy0157, M6_Spy0158, M6_Spy0159, 
5 M6_SpyO 1 60, M6_Spy0 161. 

Applicants have also identified open reading frames encoding fimbrial structural subunits in 
other GAS bacteria harbouring an AI- 1 . These open reading frames encode fimbrial structural 
subunits CDC SS 410jTimbrial, ISS3650_fimbrial, and DSM2071 Jfimbrial. A GAS AI-1 may 
comprise a polynucleotide encoding any one of CDC SS 410_fimbrial, IS S3 65 OJSmbrial, and 

10 DSM2071_fimbrial. 

As discussed above, the hyper-oligomeric pilus structure of GAS AI-1 appears to be 
responsible for the T- antigen type 6 classification, and GAS AI-1 corresponds to the FCT region 
previously identified for tee6. As in GAS AI-1, the tee6 FCT region includes open reading frames 
encoding for a collagen adhesion protein (cpa, capsular polysaccharide adhesion) and a fibronectin 

15 binding protein (prtFl). Immunoblots of tee6, a GAS AI-1 fimbrial structural subunit corresponding 
to M6_Spyl60, reveal high molecular weight structures indicative of the hyper-oligomeric pilus 
structures. Immunoblots with antiserum specific for Cpa also recognize a high molecular weight 
ladder structure, indicating Cpa involvement in the GAS AI-1 pilus structure or formation. In EM 
photos of GAS bacteria, Cpa antiserum reveals abundant staining on the surface of the bacteria and 

20 occasional gold particles extended from the surface of the bacteria. In contrast, immunoblots with 
antiserum specific for PrtFl recognize only a single molecular species with electrophoretic mobility 
corresponding to its predicted molecular mass, indicating that PrtFl may not be associated with the 
oligomeric pilus structure. A preferred immunogenic composition of the invention comprises a GAS 
AI-1 surface protein which may be formulated or purified in an oligomeric (pilis) form. In a preferred 

25 embodiment, the oligomeric form is a hyperoligomer. Another preferred immunogenic composition 
of the invention comprises a GAS AI-1 surface protein which has been isolated in an oligomeric 
(pilis) form. The oligomer or hyperoligomeric pilus structures comprising the GAS AI-1 surface 
proteins may be purified or otherwise formulate for use in immunogenic compositions. 

One or more of the GAS AI-1 open reading frame polynucleotide sequences may be replaced 

30 by a polynucleotide sequence coding for a fragment of the replaced ORF. Alternatively, one or more 
of the GAS AI-1 open reading frames may be replaced by a sequence having sequence homology to 
the replaced ORF. 

One or more of the GAS AI-1 surface protein sequences typically include an LPXTG motif 

(such as LPXTG (SEQ ID NO: 122)) or other sortase substrate motif. 

35 The LPXTG sortase substrate motif of a GAS AI surface protein may be generally 

represented by the formula XXXXG, wherein X at amino acid position 1 is an L, a V, an E, or a Q, 

wherein X at amino acid position 2 is a P if X at amino acid position 1 is an L, wherein X at amino 

acid position 2 is a V if X at amino acid position 1 is a E or a Q, wherein X at amino acid position 2 is 
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a^'oV'a P it X^tiimm^Wbid^ V, wherein X at amino acid position 3 is any amino acid 

residue, wherein X at amino acid position 4 is a T if X at amino acid position 1 is a V, E, or Q, and 
wherein X at amino acid position 4 is a T, S, or A if X at amino acid position 1 is an L. Some 
examples of LPXTG motifs present in GAS AI surface proteins include LPSXG (SEQ ID NO: 134), 
5 VVXTG (SEQ ID NO: 135), EVXTG (SEQ ID NO: 136), VPXTG (SEQ ID NO: 137), QVXTG 
(SEQ ID NO: 138), LPXAG (SEQ ID NO: 139), QVPTG (SEQ ID NO: 140), and FPXTG (SEQ ID 
NO: 141), 

The GAS AI surface proteins of the invention may affect the ability of the GAS bacteria to 
adhere to and invade epithelial cells. AI surface proteins may also affect the ability of GAS to 

10 translocate through an epithelial cell layer. Preferably, one or more GAS AI surface proteins are 
capable of binding to or otherwise associating with an epithelial cell surface. GAS AI surface 
proteins may also be able to bind to or associate with fibrinogen, fibronectin, or collagen. 

The GAS AI-1 sortase proteins are predicted to be involved in the secretion and anchoring of 
the LPXTG containing surface proteins. GAS AI-1 may encode for at least one surface protein. 

15 Alternatively, GAS AI-1 may encode for at least two surface exposed proteins and at least one 

sortase. Preferably, GAS AI-1 encodes for at least three surface exposed proteins and at least two 
sortases. 

The AI surface proteins may be covalently attached to the bacterial cell wall by membrane- 
associated transpeptidases, such as an AI sortase. The sortase may function to cleave the surface 

20 protein, preferably between the threonine and glycine residues of an LPXTG motif. The sortase may 
then assist in the formation of an amide link between the threonine carboxyl group and a cell wall 
precursor such as lipid II. The precursor can then be incorporated into the peptidoglycan via the 
transglycoslylation and transpeptidation reactions of bacterial wall synthesis. See Comfort et al., 
Infection & Immunity (2004) 72(5): 2710 - 2722. 

25 GAS AI-1 preferably includes a srtB sortase. GAS srtB sortases may preferably anchor 

surface proteins with an LPSTG motif (SEQ ID NO: 166), particularly where the motif is followed by 
a serine. 

In one embodiment, the invention includes a composition comprising oligomeric, pilus-like 
structures comprising a GAS AI-1 surface protein such as M6_Spy0157, M6_Spy0159, M6_Spy0160, 

30 CDC SS 410_fimbrial, ISS3650_fimbrial, or DSM2071_fimbrial. The oligomeric, pilus-like structure 
may comprise numerous units of AI surface protein. Preferably, the oligomeric, pilus-like structures 
comprise two or more AI surface proteins. Still more preferably, the oligomeric, pilus-like structure 
comprises a hyper-oligomeric pilus-like structure comprising at least two (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 
10, 11, 12, 13, 14, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 120, 140, 150, 200 or more) 

35 oligomeric subunits, wherein each subunit comprises an AI surface protein or a fragment thereof. The 
oligomeric subunits may be covalently associated via a conserved lysine within a pilin motif. The 
• oligomeric subunits may be covalently associated via an LPXTG motif, preferably, via the threonine 
or serine amino acid residue, respectively. 
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structures of the invention will preferably include a pilin motif. 

The oligomeric, pilus like structures may be used alone or in the combinations of the 
invention. In one embodiment, the invention comprises a GAS Adhesin Island protein in oligomeric 
form, preferably in a hyperoligomeric form. In one embodiment, the invention comprises a 
composition comprising one or more GAS Adhesin Island 1 ("GAS AI-1") proteins and one or more 
GAS Adhesin Island 2 ("GAS AI-2"), GAS Adhesin Island 3 ("GAS AI-3"), or GAS Adhesin Island 
4 ("GAS AI-4") proteins, wherein one or more of the GAS Adhesin Island proteins is in the form of 
an oligomer, preferably in a hyperoligomeric form. 

In addition to the open reading frames encoding the GAS AI-1 proteins, GAS AM may also 
include a divergently transcribed transcriptional regulator such as RofA {i.e., the transcriptional 
regulator is located near or adjacent to the AI protein open reading frames, but it transcribed in the 
opposite direction). 
GAS Adhesin Island 2 

A second adhesin island, "GAS Adhesin Island 2" or "GAS AI-2" has also been identified in 
Group A Streptococcus serotypes and isolates. G AS AI-2 comprises a series of approximately eight 
open reading frames encoding for a collection of amino acid sequences comprising surface proteins 
and sortases ("GAS AI-2 proteins"). Specifically, GAS AI-2 includes open reading frames encoding 
for two or more {i.e., 2, 3, 4, 5, 6, 7, or 8) of GAS15, Spy0127, GAS 1 6, GAS 17, GAS 18, Spy0131, 
Spy0133, andGAS20. 

A preferred immunogenic composition of the invention comprises a GAS AI-2 surface 
protein which may be formulated or purified in an oligomeric (pilis) form. In a preferred 
embodiment, the oligomeric form is a hyperoligomer. Another preferred immunogenic composition 
of the invention comprises a GAS AI-2 surface protein which has been isolated in an oligomeric 
(pilis) form. The oligomer or hyperoligomeric pilus structures comprising the GAS AI-2 surface 
proteins may be purified or otherwise formulate for use in immunogenic compositions. 

One or more of the GAS AI-2 open reading frame polynucleotide sequences may be replaced 
by a polynucleotide sequence coding for a fragment of the replaced GRP. Alternatively, one or more 
of the GAS AI-2 open reading frames may be replaced by a sequence having sequence homology to 
the replaced ORF. 

One or more of the GAS AI-2 surface protein sequences typically include an LPXTG motif 
(such as LPXTG (SEQ ID NO: 122)) or other sortase substrate motif. The AI surface proteins of the 
invention may affect the ability of the GAS bacteria to adhere to and invade epithelial cells. AI 
surface proteins may also affect the ability of GAS to translocate through an epithelial cell layer. 
Preferably, one or more AI surface proteins are capable of binding to or otherwise associating with an 
epithelial cell surface. AI surface proteins may also be able to bind to or associate with fibrinogen, 
fibronectin, or collagen. 
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il^'SAs^^%^ei§^ro^\ii§M predicted to be involved in the secretion and anchoring of 
the LPXTG containing surface proteins. GAS AI-2 may encode for at least one surface protein. 
Alternatively, GAS AI-2 may encode for at least two surface exposed proteins and at least one 
sortase. Preferably, GAS AI-2 encodes for at least three surface exposed proteins and at least two 
5 sortases. 

The AI surface proteins may be covalently attached to the bacterial cell wall by membrane- 
associated transpeptidases, such as an AI sortase. The sortase may function to cleave the surface 
protein, preferably between the threonine and glycine residues of an LPXTG motif. The sortase may 
then assist in the formation of an amide link between the threonine carboxyl group and a cell wall 

10 precursor such as lipid IL The precursor can then be incorporated into the peptidoglycan via the 
transglycoslylation and transpeptidation reactions of bacterial wall synthesis. See Comfort et al., 
Infection & Immunity (2004) 72(5): 2710 - 2722. 

In one embodiment, the invention includes a composition comprising oligomeric, pilus-like 
structures comprising an AI surface protein such as GAS 15, GAS 16, or GAS 18. The oligomeric, 

15 pilus-like structure may comprise numerous units of AI surface protein. Preferably, the oligomeric, 
pilus-like structures comprise two or more AI surface proteins. Still more preferably, the oligomeric, 
pilus-like structure comprises a hyper-oligomeric pilus-like structure comprising at least two (e.g., 2, 
3,4,5, 6, 7,8,9, 10, 11, 12, 13, 14, 15,20, 25,30,35,40, 45, 50, 60, 70, 80, 90, 100, 120, 140, 150, 
200 or more) oligomeric subunits, wherein each subunit comprises an AI surface protein or a 

20 fragment thereof. The oligomeric subunits may be covalently associated via a conserved lysine within 
a pilin motif. The oligomeric subunits may be covalently associated via an LPXTG motif, preferably, 
via the threonine amino acid residue. 

AI surface proteins or fragments thereof to be incorporated into the oligomeric, pilus-like 
structures of the invention will preferably include a pilin motif. 

25 The oligomeric, pilus like structures may be used alone or in the combinations of the 

invention. In one embodiment, the invention comprises a GAS Adhesin Island protein in oligomeric 
form, preferably in a hyperoligomeric form. In one embodiment, the invention comprises a 
composition comprising one or more GAS Adhesin Island 2 ("GAS AI-2") proteins and one or more 
GAS Adhesin Island 1 ("GAS AI-1"), GAS Adhesin Island 3 ("GAS AI-3"), or GAS Adhesin Island 

30 4 ("GAS AI-4") proteins, wherein one or more of the Adhesin Island proteins is in the form of an 
oligomer, preferably in a hyperoligomeric form. 

In addition to the open reading frames encoding the GAS AI-2 proteins, GAS AI-2 may also 
include a divergently transcribed transcriptional regulator such as rofA (i.e., the transcriptional 
regulator is located near or adjacent to the AI protein open reading frames, but it transcribed in the 

35 opposite direction). 

GAS Adhesin Island 3 

A third adhesin island, "GAS Adhesin Island 3" or "GAS AI-3" has also been identified in 

several Group A Streptococcus serotypes and isolates. GAS AI-3 comprises a series of approximately 
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severroperf reading feme's eri'cddirig For a collection of amino acid sequences comprising surface 
proteins and sortases ("GAS AI-3 proteins"). Specifically, GAS AI-3 includes open reading frames 
encoding for two or more (i.e., 2, 3, 4, 5, 6, or 7) of SpyM3_0098, SpyM3_0099, SpyJVBJ) 1 00, 
SpyM3_0101, SpyM3J)102, SpyM3_0103, SpyM3_0104, SPsOlOO, SPsOlOl, SPs0102, SPs0103, 
SPs0104, SPs0105, SPs0106, orf78, orf79, orf80, orfSl, orf82, orf83, orf84, spyM18J)126, 
spyM18_0127, spyM18J)128, spyM18_0129, spyM18J)130, spyM18J)131, spyM18_0132, 
SpyoM01000156, SpyoMOlOOOl 55, SpyoMO 1000 154, SpyoMO 1000 153, SpyoMO 1000 152, 
SpyoM01000151, SpyoMO 1000 150, and SpyoMO 1000 149. In one embodiment, GAS AI-3 includes 
open reading frames encoding for two or more (i.e., 2, 3, 4, 5, 6, or 7) of SpyM3_0098, SpyM3 J)099, 
SpyM3_0100, SpyM3_0101, SpyM3_0102, SpyM3J)103, and SpyM3_0104. In another 
embodiment, GAS AI-3 includes open reading frames encoding for two or more (i.e., 2, 3, 4, 5, 6, or 
7) of SPsOlOO, SPsOlOl, SPs0102, SPs0103, SPsO104, SPs0105, and SPs0106. In a further 
embodiment, GAS AI-3 includes open reading frames encoding for two or more (i.e., 2, 3, 4, 5, 6, or 
7) of orf78, orf79, orf80, orfSl, orf82, orf83, and orf84. In yet another embodiment, GAS AI-3 
includes open reading frames encoding for two or more (i.e., 2, 3, 4, 5, 6, or 7) of spyM18_0126, 
spyM18_0127, spyM18J)128, spyM18_0129, spyM18_0130, spyM18_0131, and spyM18J)132. In 
yet another embodiment, GAS AI-3 includes open reading frames encoding for two or more (i.e., 2, 3, 
4, 5, 6, or 7) of SpyoMO 1000 15 6, SpyoMO 1000 155, SpyoMO 1000 154, SpyoMO 1000 153, 
SpyoMOlOOOl 52, SpyoM01000151, SpyoMO 1000 150, and SpyoMO 1000 149. 

Applicants have also identified open reading frames encoding fimbrial structural subunits in 
other GAS bacteria harbouring an AI-3. These open reading frames encode fimbrial structural 
subunits ISS3040_fimbrial, ISS3776_fimbrial, and ISS4959jfimbriaL A GAS AI-3 may comprise a 
polynucleotide encoding any one of ISS3040_fimbrial, ISS3776_fimbrial, and ISS4959jfimbrial. 

One or more of the GAS AI-3 open reading frame polynucleotide sequences may be replaced 
by a polynucleotide sequence coding for a fragment of the replaced ORF. Alternatively, one or more 
of the GAS AI-3 open reading frames may be replaced by a sequence having sequence homology to 
the replaced ORF. 

A preferred immunogenic composition of the invention comprises a GAS AI-3 surface 
protein which may be formulated or purified in an oligomeric (pilis) form. In a preferred 
embodiment, the oligomeric form is a hyperoligomer. Another preferred immunogenic composition 
of the invention comprises a GAS AI-3 surface protein which has been isolated in an oligomeric 
(pilis) form. The oligomer or hyperoligomeric pilus structures comprising the GAS AI-3 surface 
proteins may be purified or otherwise formulate for use in immunogenic compositions. 

One or more of the GAS AI-3 surface protein sequences typically include an LPXTG motif 

(such as LPXTG (SEQ ID NO: 122)) or other sortase substrate motif. The AI surface proteins of the 

invention may affect the ability of the GAS bacteria to adhere to and invade epithelial cells. AI 

surface proteins may also affect the ability of GAS to translocate through an epithelial cell layer. 

Preferably, one or more AI surface proteins are capable of binding to or otherwise associating with an 
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epithelial Cell Surface. ""Al surface proteins may also be able to bind to or associate with fibrinogen, 
fibronectin, or collagen. 

The GAS AI-3 sortase proteins are predicted to be involved in the secretion and anchoring of 
the LPXTG containing surface proteins. GAS AI-3 may encode for at least one surface protein. 
5 Alternatively, GAS AI-3 may encode for at least two surface exposed proteins and at least one 

sortase. Preferably, GAS AI-3 encodes for at least three surface exposed proteins and at least two 
sortases. 

The AI surface proteins may be covalently attached to the bacterial cell wall by membrane- 
associated transpeptidases, such as an AI sortase. The sortase may function to cleave the surface 
10 protein, preferably between the threonine and glycine residues of an LPXTG motif. The sortase may 
then assist in the formation of an amide link between the threonine or alanine carboxyl group and a 
cell wall precursor such as lipid II. The precursor can then be incorporated into the peptidoglycan via 
the transglycoslylation and transpeptidation reactions of bacterial wall synthesis. See Comfort et al., 
Infection & Immunity (2004) 72(5): 2710 - 2722. 

15 The invention includes a composition comprising oligomeric, pilus-like structures comprising 

an AI surface protein such as SpyM3_0098, SpyM3_0100, SpyM3_0102, SpyM3_0104, SPsOlOO, 
SPs0102, SPs0104, SPs0106, orf78, orf80, orf82, orf84, spyM18_0126, spyM18_0128, 
spyM 18J)130, spyM18_0132, SpyoMO 1000 155, SpyoMO 1000 153, SpyoMO 1000151, 
SpyoM01000149, ISS3040_frmbrial, ISS3776jBmbrial, and ISS4959_fimbrial. In one embodiment, 

20 the invention includes a composition comprising oligomeric, pilus-like structures comprising an AI 
surface protein such as SpyM3_0098, SpyM3_0100, SpyM3_0102, and SpyM3_0104. In another 
embodiment, the invention includes a composition comprising oligomeric, pilus-like structures 
comprising an AI surface protein such as SPsOlOO, SPs0102, SPs0104, and SPs0106. In another 
embodiment, the invention includes a composition comprising oligomeric, pilus-like structures 

25 comprising an AI surface protein such as orf78, orf80, orf82, and orf84. In yet another embodiment, 
the invention includes a composition comprising oligomeric, pilus-like structures comprising an AI 
surface protein such as spyM18_0126, spyM18_0128, spyM18_0130, and spyM18_0132. In a further 
embodiment, the invention includes a composition comprising oligomeric, pilus-like structures 
comprising an AI surface protein such as SpyoMO 1000 155, SpyoMO 1000 153, SpyoM01000151, and 

30 SpyoMO 1000 149. In yet a further embodiment, the invention includes a composition comprising 
oligomeric, pilus-like structures comprising an AI surface protein such as ISS3040_fimbrial, 
ISS3776_fimbrial, and ISS4959_fimbriaL The oligomeric, pilus-like structure may comprise 
numerous units of AI surface protein. Preferably, the oligomeric, pilus-like structures comprise two 
or more AI surface proteins. Still more preferably, the oligomeric, pilus-like structure comprises a 

35 hyper-oligomeric pilus-like structure comprising at least two {e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 

14, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 120, 140, 150, 200 or more) oligomeric 

subunits, wherein each subunit comprises an AI surface protein or a fragment thereof. The oligomeric 

subunits may be covalently associated via a conserved lysine within a pilin motif. The oligomeric 
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sikBuMts*m'ay We'Sovalehuy associated' via an LPXTG motif, preferably, via the threonine amino acid 

residue. 

AI surface proteins or fragments thereof to be incorporated into the oligomeric, pilus-like 
structures of the invention will preferably include a pilin motif. 

The oligomeric, pilus like structures may be used alone or in the combinations of the 
invention. In one embodiment, the invention comprises a GAS Adhesin Island protein in oligomeric 
form, preferably in a hyperoligomeric form. In one embodiment, the invention comprises a 
composition comprising one or more GAS Adhesin Island 3 ("GAS AI-3") proteins and one or more 
GAS Adhesin Island 1 ("GAS AI-1"), GAS Adhesin Island 2 ("GAS AI-2"), or GAS Adhesin Island 
4 ("GAS AI-4") proteins, wherein one or more of the Adhesin Island proteins is in the form of an 
oligomer, preferably in a hyperoligomeric form. 

In addition to the open reading frames encoding the GAS AI-3 proteins, GAS AI-3 may also 
include a transcriptional regulator such as Nra, 
GAS Adhesin Island 4 

A fourth adhesin island, "GAS Adhesin Island 4" or "GAS AI-4" has also been identified in 
Group A Streptococcus serotypes and isolates. GAS AI-4 comprises a series of approximately eight 
open reading frames encoding for a collection of amino acid sequences comprising surface proteins 
and sortases ("GAS AI-4 proteins"). Specifically, GAS AI-4 includes open reading frames encoding 
for two or more (i.e., 2, 3, 4, 5, 6, 7, or 8) of 19224134, 19224135, 19223136, 19223137, 19224138, 
19224139, 19224140, and 19224141. 

Applicants have also identified open reading frames encoding fimbrial structural subunits in 
other GAS bacteria harbouring an AI-4. These open reading frames encode fimbrial structural 
subunits 20010296_fimbrial, 200200 6 9__fimbrial, CDC SS 635jBmbrial, ISS4883_fimbrial, and 
ISS4538_fimbrial. A GAS AI-4 may comprise a polynucleotide encoding any one of 
20010296 Jimbrial, 20020069_fimbrial, CDC SS 635_fimbrial, ISS4883_fimbrial, and 
ISS4538_fimbrial. 

One or more of the GAS AI-4 open reading frame polynucleotide sequences may be replaced 
by a polynucleotide sequence coding for a fragment of the replaced ORR Alternatively, one or more 
of the GAS AI-4 open reading frames may be replaced by a sequence having sequence homology to 
the replaced ORF. 

A preferred immunogenic composition of the invention comprises a GAS AI-4 surface 
protein which may be formulated or purified in an oligomeric (pilis) form. In a preferred 
embodiment, the oligomeric form is a hyperoligomer. Another preferred immunogenic composition 
of the invention comprises a GAS AI-4 surface protein which has been isolated in an oligomeric 
(pilis) form. The oligomer or hyperoligomeric pilus structures comprising the GAS AI-4 surface 
proteins may be purified or otherwise formulate for use in immunogenic compositions. 

One or more of the GAS AI-4 surface protein sequences typically include an LPXTG motif 

(such as LPXTG (SEQ ID NO: 122)) or other sortase substrate motif. The AI surface proteins of the 
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invention' ma}4S Jt'lh? ^ilty'of rhV^!&fe bacteria to adhere to and invade epithelial cells. AI 
surface proteins may also affect the ability of GAS to translocate through an epithelial cell layer. 
Preferably, one or more AI surface proteins are capable of binding to or otherwise associating with an 
epithelial cell surface. AI surface proteins may also be able to bind to or associate with fibrinogen, 
5 fibronectin, or collagen. 

The GAS AI-4 sortase proteins are predicted to be involved in the secretion and anchoring of 
the LPXTG containing surface proteins. GAS AI-4 may encode for at least one surface protein. 
Alternatively, GAS AI-4 may encode for at least two surface exposed proteins and at least one 
sortase. Preferably, GAS AI-4 encodes for at least three surface exposed proteins and at least two 
10 sortases. 

The AI surface proteins may be covalently attached to the bacterial cell wall by membrane- 
associated transpeptidases, such as an AI sortase. The sortase may function to cleave the surface 
protein, preferably between the threonine and glycine residues of an LPXTG motif. The sortase may 
then assist in the formation of an amide link between the threonine carboxyl group and a cell wall 

15 precursor such as lipid II. The precursor can then be incorporated into the peptidoglycan via the 
transglycoslylation and transpeptidation reactions of bacterial wall synthesis. See Comfort et al., 
Infection & Immunity (2004) 72(5): 27 10 - 2722. 

In one embodiment, the invention includes a composition comprising oligomeric, pilus-like. 
structures comprising an AI surface protein such as 19224134, 19224135, 19224137, 19224139, 

20 19224141, 20010296_fimbrial, 20020069_fimbrial, CDC SS 635_flmbrial, ISS4883_fimbrial, and 
ISS4538_fimbrial. The oligomeric, pilus-like structure may comprise numerous units of AI surface 
protein. Preferably, the oligomeric, pilus-like structures comprise two or more AI surface proteins. 
Still more preferably, the oligomeric, pilus-like structure comprises a hyper-oligomeric pilus-like 
structure comprising at least two {e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 15, 20, 25, 30, 35, 40, 

25 45, 50, 60, 70, 80, 90, 100, 120, 140, 150, 200 or more) oligomeric subunits, wherein each subunit 
comprises an AI surface protein or a fragment thereof. The oligomeric subunits may be covalently 
associated via a conserved lysine within a pilin motif. The oligomeric subunits may be covalently 
associated via an LPXTG motif, preferably, via the threonine amino acid residue. 

AI surface proteins or fragments thereof to be incorporated into the oligomeric, pilus-like 

30 structures of the invention will preferably include a pilin motif. 

The oligomeric, pilus like structures may be used alone or in the combinations of the 
invention. In one embodiment, the invention comprises a GAS Adhesin Island protein in oligomeric 
form, preferably in a hyperoligomeric form. In one embodiment, the invention comprises a 
composition comprising one or more GAS Adhesin Island 4 ("GAS AI-4") proteins and one or more 

35 GAS Adhesin Island 1 ("GAS AI-1"), GAS Adhesin Island 2 ("GAS AI-2"), or GAS Adhesin Island 
3 ("GAS AI-3") proteins, wherein one or more of the Adhesin Island proteins is in the form of an 
oligomer, preferably in a hyperoligomeric form. 
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f J! " KidMibMo 1 tBe"opin fe^iilg'^lmes encoding the GAS AI-4 proteins, GAS AI-4 may also 
include a divergently transcribed transcriptional regulator such as rofA (i.e., the transcriptional 
regulator is located near or adjacent to the AI protein open reading frames, but it transcribed in the 
opposite direction). 

5 The oligorneric, pilus-like structures of the invention may be combined with one or more 

additional GAS proteins. In one embodiment, the oligorneric, pilus-like structures comprise one or 
more AI surface proteins in combination with a second GAS protein. 

The oligorneric, pilus-like structures may be isolated or purified from bacterial cultures in 
which the bacteria express an AI surface protein. The invention therefore includes a method for 

10 manufacturing an oligorneric AI surface antigen comprising culturing a GAS bacterium that expresses 
the oligorneric AI protein and isolating the expressed oligorneric AI protein from the GAS bacteria. 
The AI protein may be collected from secretions into the supernatant or it may be purified from the 
bacterial surface. The method may further comprise purification of the expressed AI protein. 
Preferably, the AI protein is in a hyperoligomeric form. 

15 The oligorneric, pilus-like structures may be isolated or purified from bacterial cultures 

overexpressing an AI surface protein. The invention therefore includes a method for manufacturing 
an oligorneric Adhesin Island surface antigen comprising culturing a GAS bacterium adapted for 
increased AI protein expression and isolation of the expressed oligorneric Adhesin Island protein from 
the GAS bacteria. The AI protein may be collected from secretions into the supernatant or it may be 

20 purified from the bacterial surface. The method may further comprise purification of the expressed 
Adhesin Island protein. Preferably, the Adhesin Island protein is in a hyperoligomeric form. 

The GAS bacteria are preferably adapted to increase AI protein expression by at least two 
(e.g., 2, 3, 4, 5, 8, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 125, 150 or 200) times wild 
type expression levels. 

25 GAS bacteria may be adapted to increase AI protein expression by any means known in the 

art, including methods of increasing gene dosage and methods of gene upregulation. Such means 
include, for example, transformation of the GAS bacteria with aplasmid encoding the AI protein. 
The plasmid may include a strong promoter or it may include multiple copies of the sequence 
encoding the AI protein. Optionally, the sequence encoding the AI protein within the GAS bacterial 

30 genome may be deleted. Alternatively, or in addition, the promoter regulating the GAS Adhesin 
Island may be modified to increase expression. 

The invention further includes GAS bacteria which have been adapted to produce increased 
levels of AI surface protein. In particular, the invention includes GAS bacteria which have been 
adapted to produce oligorneric or hyperoligomeric AI surface protein. In one embodiment, the Gram 

35 positive bacteria of the invention are inactivated or attenuated to permit in vivo delivery of the whole . 

bacteria, with the AI surface protein exposed on its surface. 

The invention further includes GAS bacteria which have been adapted to have increased 

levels of expressed AI protein incorporated in pili on their surface. The GAS bacteria may be adapted 
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tl'nlve SdfeIsla :: ^C : S^e 'o f fiy§m'Jic ; fer hyperoligomeric AI proteins on its surface by increasing 
expression levels of LepA polypeptide, or an equivalent signal peptidase, in the GAS bacteria. 
Applicants have shown that deletion of LepA in strain SF370 bacteria, which harbour a GAS AI-2, 
abolishes surface exposure of M and pili proteins on the GAS. Increased levels of LepA expression in 
GAS are expected to result in increased exposure of M and pili proteins on the surface of GAS. 
Increased expression of LepA in GAS may be achieved by any means known in the art, such as 
increasing gene dosage and methods of gene upregulation. The GAS bacteria adapted to have 
increased levels of LepA expression may additionally be adapted to express increased levels of at 
least one pili protein. 

Alternatively, the AI proteins of the invention may be expressed on the surface of a non- 
pathogenic Gram positive bacteria, such as Streptococus gordonii (See, e.g., Byrd et al., "Biological 
consequences of antigen and cytokine co-expression by recombinant Streptococcus gordonii vaccine 
vectors", Vaccine (2002) 20:2197-2205) or Lactococcus lactis (See, e.g., Mann am et al., "Mucosal 
Vaccine Made from Live, Recombinant Lactococcus lactis Protects Mice against Pharangeal Infection 
with Streptococcus pyogenes" Infection and Immunity (2004) 72(6): 3 444-3 450). As used herein, 
non-pathogenic Gram positive bacteria refer to Gram positive bacteria which are compatible with a 
human host subject and are not associated with human pathogenisis. Preferably, the non-pathogenic 
bacteria are modified to express the AI surface protein in oligomeric, or hyper-oligomeric form. 
Sequences encoding for an AI surface protein and, optionally, an AI sortase, may be integrated into 
the non-pathogenic Gram positive bacterial genome or inserted into a plasmid. The non-pathogenic 
Gram positive bacteria may be inactivated or attenuated to facilitate in vivo delivery of the whole 
bacteria, with the AI surface protein exposed on its surface. Alternatively, the AI surface protein may 
be isolated or purified from a bacterial culture of the non-pathogenic Gram positive bacteria. For 
example, the AI surface protein may be isolated from cell extracts or culture supernatants. 
Alternatively, the AI surface protein may be isolated or purified from the surface of the non- 
pathogenic Gram positive bacteria. 

The non-pathogenic Gram positive bacteria may be used to express any of the GAS Adhesin 
Island proteins described herein. The non-pathogenic Gram positive bacteria are transformed to 
express an Adhesin Island surface protein. Preferably, the non-pathogenic Gram positive bacteria also 
express at least one Adhesin Island sortase. The AI transformed non-pathogenic Gram positive 
bacteria of the invention may be used to prevent or treat infection with pathogenic GAS. 

Applicants modified L, lactis to demonstrate that, like GBS polypeptides, it can express GAS 

AI polypeptides. L, lactis was transformed with pAM401 constructs encoding entire pili gene clusters 

of AI-1, AI-2, and AI-4 adhesin islands. Briefly, the pAM401 is a promoterless high-copy plasmid. 

The entire pili gene clusters of anM6 (AI-1), Ml (AI-2), and M12 (AI-4) bacteria were inserted into 

the pAM401 construct. The gene clusters were transcribed under the control their own (M6, Ml , or 

Ml 2) promoter or the GBS promoter that successfully initiated expression of the GBS AI-1 adhesin 

islands inZ,. lactis, described above. Figure 172 provides a schematic depiction of GAS M6 (AI-1), 
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3VJ?V&-^ iidhesii"isilnil and indicates the portions of the adhesin island sequences 

inserted in the pAM401 construct. 

Each of the L. lactis transformed with one of the M6, Ml , or Ml 2 adhesin island gene 
clusters expressed high molecular weight structures that were immunoreactive with antibodies that 
bind to polypeptides present in their respective pili. Figures 173 A-C provide results of Western blot 
analysis of surface protein-enriched extracts of L. lactis transformed with M6 (Figure 173 A), Ml 
(Figure 173 B), or Ml 2 (Figure 173 C) adhesin island gene clusters using antibodies that bind to the 
fimbrial structural subunit encoded by each cluster. Figure 173 A at lanes 3 and 4 shows detection of 
high molecular structures in L. lactis transformed with an adhesin island pilus gene cluster from an 
Ml AI-2 using an antibody that binds to fimbrial structural subunit Spy0128. Figure 173B at lanes 3 
and 4 shows detection of high molecular weight structures in L. lactis transformed with an adhesin 
island pilus gene cluster from an M12 AI-4 using an antibody that binds to fimbrial structural subunit 
EftLSL.A. Figure 173C at lane 3 shows detection of high molecular weight structures inL. lactis 
transformed with an adhesin island pilus gene cluster from an M6 AI-1 using an antibody that binds to 
fimbrial structural subunit M6_Spy0160. In figures 173 A-C, "pi" immediately following the 
notation of AI subtype indicates that the promoter present in the Adhesin Island is used to drive 
transcription of the adhesin island gene cluster and "p2" indicates that the promoter was the GBS 
promoter described above. Thus, it appears that L. lactis is capable of expressing the fimbrial 
structural subunits encoded by GAS adhesin islands in an oligomeric form. 

Alternatively, the oligomeric, pilus-like structures may be produced recombinantly. If 
produced in a recombinant host cell system, the AI surface protein will preferably be expressed in 
coordination with the expression of one or more of the AI sortases of the invention. Such AI sortases 
will facilitate oligomeric or hyperoligomeric formation of the AI surface protein subunits. 
S. pneumoniae from TIGR4 Adhesin Island 

As discussed above, Applicants have identified adhesin islands within the genome of S. 
pneumoniae from TIGR4. The S. pneumoniae from TIGR4 Adhesin Island comprises a series of 
approximately seven open reading frames encoding for a collection of amino acid sequences 
comprising surface proteins and sortases. Specifically, the S. pneumoniae from TIGR4 AI proteins 
includes open reading frames encoding for two or more (i.e., 2, 3, 4, 5, 6, or 7) of SP0462, SP0463, 
SP0464, SP0465, SP0466, SP0467, and SP0468. 

A preferred immunogenic composition of the invention comprises a S. pneumoniae from 
TIGR4 AI surface protein which may be formulated or purified in an oligomeric (pilis) form. In a 
preferred embodiment, the oligomeric form is a hyperoligomer. Another preferred immunogenic 
composition of the invention comprises a S. pneumoniae from TIGR4 AI surface protein which has 
been isolated in an oligomeric (pilis) form. The oligomer or hyperoligomer pilus structures 
comprising S. pneumoniae surface proteins may be purified or otherwise formulated for use in 
immunogenic compositions. 
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'' lJi ^' Erie yiSW<S?tte from TIGR4 AI open reading frame polynucleotide 

sequences may be replaced by a polynucleotide sequence coding for a fragment of the replaced ORF. 
Alternatively, one or more of the S. pneumoniae from TIGR4 AI open reading frames may be 
replaced by a sequence having sequence homology to the replaced ORF, 

One or more of the S. pneumoniae from TIGR4 AI surface protein sequences typically 
include an LPXTG motif (such as LPXTG (SEQ ID NO: 122)) or other sortase substrate motif. 

The S. pneumoniae from TIGR4 AI surface proteins of the invention may affect the ability of 
the S. pneumoniae bacteria to adhere to and invade epithelial cells. AI surface proteins may also 
affect the ability of & pneumoniae to translocate through an epithelial cell layer. Preferably, one or 
more S. pneumoniae from TIGR4 AI surface proteins are capable of binding to or otherwise 
associating with an epithelial cell surface. S. pneumoniae from TIGR4 AI surface proteins may also 
be able to bind to or associate with fibrinogen, fibronectin, or collagen. 

The S. pneumoniae from TIGR4 AI sortase proteins are predicted to be involved in the 
secretion and anchoring of the LPXTG containing surface proteins. S. pneumoniae from TIGR4 AI 
may encode for at least one surface protein. Alternatively, S. pneumoniae from TIGR4 AI may 
encode for at least two surface exposed proteins and at least one sortase. Preferably, S. pneumoniae 
from TIGR4 AI encodes for at least three surface exposed proteins and at least two sortases. 

The AI surface proteins may be covalently attached to the bacterial cell wall by membrane- 
associated transpeptidases, such as an AI sortase. The sortase may function to cleave the surface 
protein, preferably between the threonine and glycine residues of an LPXTG motif. The sortase may 
then assist in the formation of an amide link between the threonine carboxyl group and a cell wall 
precursor such as lipid II. The precursor can then be incorporated into the peptidoglycan via the 
transglycoslylation and transpeptidation reactions of bacterial wall synthesis. See Comfort et al., 
Infection & Immunity (2004) 72(5): 2710 - 2722. 

In one embodiment, the invention includes a composition comprising oligomeric, pilus-like 
structures comprising a S. pneumoniae from TIGR4 AI surface protein such as SP0462, SP0463, 
SP0464, or SP0465. The oligomeric, pilus-like structure may comprise numerous units of AI surface 
protein. Preferably, the oligomeric, pilus-like structures comprise two or more AI surface proteins. 
Still more preferably, the oligomeric, pilus-like structure comprises a hyper-oligomeric pilus-like 
structure comprising at least two {e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 15, 20, 25, 30, 35, 40, 
45, 50, 60, 70, 80, 90, 100, 120, 140, 150, 200 or more) oligomeric subunits, wherein each subunit 
comprises an AI surface protein or a fragment thereof. The oligomeric subunits may be covalently 
associated via a conserved lysine within a pilin motif. The oligomeric subunits may be covalently 
associated via an LPXTG motif, preferably, via the threonine or serine amino acid residue, 
respectively. 

AI surface proteins or fragments thereof to be incorporated into the oligomeric, pilus-like 
structures of the invention will preferably include a pilin motif 
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s The oligomenc, pilus like structures may be used alone or 111 the combinations of the 
invention. In one embodiment, the invention comprises a S. pneumoniae from TIGR4 AI protein in 
oligomeric form, preferably in a hyperoligomeric form. In one embodiment, the invention comprises 
a composition comprising one or more S. pneumoniae from TIGR4 AI proteins and one or more S. 
5 pneumoniae strain 670 AI proteins, wherein one or more of the S. pneumoniae AI proteins is in the 
form of an oligomer, preferably in a hyperoligomeric form. 

In addition to the open reading frames encoding the S. pneumoniae from TIGR4 AI proteins, 
S. pneumoniae from TIGR4 AI may also include a transcriptional regulator. 
S. pneumoniae strain 670 Adhesin Island 
.10 As discussed above, Applicants have identified adhesin islands within the genome of S. 

pneumoniae strain 670. The S. pneumoniae strain 670 Adhesin Island comprises a series of 
approximately seven open reading frames encoding for a collection of amino acid sequences 
comprising surface proteins and sortases. Specifically, the S. pneumoniae strain 670 AI proteins 
includes open reading frames encoding for two or more (i.e., 2, 3, 4, 5, 6, or 7) of orfl_670, or£3_670, 
1 5 orf4_670, orf5_670, orf6_670, orf7_670, orf8_670. 

A preferred immunogenic composition of the invention comprises a & pneumoniae strain 670 
AI surface protein which may be formulated or purified in an oligomeric (pilis) form. Another 
preferred immunogenic composition of the invention comprises a S. pneumoniae strain 670 AI surface 
protein which has been isolated in an oligomeric (pilis) form. 
20 One or more of the S, pneumoniae strain 670 AI open reading frame polynucleotide 

sequences may be replaced by a polynucleotide sequence coding for a fragment of the replaced ORF. 
Alternatively, one or more of the & pneumoniae strain 670 AI open reading frames may be replaced 
by a sequence having sequence homology to the replaced ORF. 

One or more of the S. pneumoniae strain 670 AI surface protein sequences typically include 
25 an LPXTG motif (such as LPXTG (SEQ ID NO: 122)) or other sortase substrate motif. 

The S. pneumoniae strain 670 AI surface proteins of the invention may affect the ability of the 
S. pneumoniae bacteria to adhere to and invade epithelial cells. AI surface proteins may also affect 
the ability of S. pneumoniae to translocate through an epithelial cell layer. Preferably, one or more S, 
pneumoniae strain 670 AI surface proteins are capable of binding to or otherwise associating with an 
30 epithelial cell surface. S. pneumoniae strain 670 AI surface proteins may also be able to bind to or 
associate with fibrinogen, fibronectin, or collagen. 

The S. pneumoniae strain 670 AI sortase proteins are predicted to be involved in the secretion 
and anchoring of the LPXTG containing surface proteins. S. pneumoniae strain 670 AI may encode 
for at least one surface protein. Alternatively, S. pneumoniae strain 670 AI may encode for at least 
35 two surface exposed proteins and at least one sortase. Preferably, S. pneumoniae strain 670 AI 
encodes for at least three surface exposed proteins and at least two sortases. 

The AI surface proteins may be covalently attached to the bacterial cell wall by membrane- 
associated transpeptidases, such as an AI sortase. The sortase may function to cleave the surface 
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pfotlifi, IkfeMyiiy I'etWe^n ffi^thiiMme : and glycine residues of an LPXTG motif. The sortase may 
then assist in the formation of an amide link between the threonine carboxyl group and a cell wall 
precursor such as lipid II. The precursor can then be incorporated into the peptidoglycan via the 
transglycoslylation and transpeptidation reactions of bacterial wall synthesis. See Comfort et al., 
5 Infection & Immunity (2004) 72(5): 27 1 0 - 2722. 

In one embodiment, the invention includes a composition comprising oligomeric, pilus-like 
structures comprising a S. pneumoniae strain 670 AI surface protein such as orf3_670, orf4_670, or 
orf5_670. The oligomeric, pilus-like structure may comprise numerous units of AI surface protein. 
Preferably, the oligomeric, pilus-like structures comprise two or more AI surface proteins. Still more 
10 preferably, the oligomeric, pilus-like structure comprises a hyper-oligomeric pilus-like structure 

comprising at least two {e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25, 30, 35, 40, 45, 50, 60, 
70, 80, 90, 100, 120, 140, 150, 200 or more) oligomeric subunits, wherein each subunit comprises an 
AI surface protein or a fragment thereof. The oligomeric subunits may be covalently associated via a 
conserved lysine within a pilin motif. The oligomeric subunits may be covalently associated via an 
15 LPXTG motif, preferably, via the threonine or serine amino acid residue, respectively. 

AI surface proteins or fragments thereof to be incorporated into the oligomeric, pilus-like 
structures of the invention will preferably include a pilin motif. 

The oligomeric, pilus like structures may be used alone or in the combinations of the 
invention. In one embodiment, the invention comprises a S. pneumoniae strain 670 AI protein in 
20 oligomeric form, preferably in a hyperoligomeric form. In one embodiment, the invention comprises 
a composition comprising one or more S. pneumoniae strain 670 AI proteins and one or more S. 
pneumoniae from TIGR4 AI proteins, wherein one or more of the S. pneumoniae AI proteins is in the 
form of an oligomer, preferably in a hyperoligomeric form. 

In addition to the open reading frames encoding the S. pneumoniae strain 670 AI proteins, S. 
25 pneumoniae strain 670 AI may also include a transcriptional regulator. 
S. pneumoniae strain 14 CSR 10 Adhesin Island 

As discussed above, Applicants have identified adhesin islands within the genome of S. 
pneumoniae strain 14 CSR 10. The S. pneumoniae strain 14 CSR 10 Adhesin Island comprises a 
series of approximately seven open reading frames encoding for a collection of amino acid sequences 
30 comprising surface proteins and sortases. Specifically, the S. pneumoniae strain 14 CSR 10 AI 
proteins includes open reading frames encoding for two or more {i.e., 2, 3, 4, 5, 6, or 7) of 
ORF2J4CSR, ORF3_14CSR, ORJF4_14CSR, ORF5_14CSR, ORF6_J4CSR, ORF7_14CSR, 
ORF8_14CSR. 

A preferred immunogenic composition of the invention comprises a S. pneumoniae strain 14 
35 CSR 10 AI surface protein which may be formulated or purified in an oligomeric (pilis) form. 

Another preferred immunogenic composition of the invention comprises a S. pneumoniae strain 14 
CSR 10 AI surface protein which has been isolated in an oligomeric (pilis) form. 
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$ Ji " f)rie W^Wcflpifte M'piiekm'SM^e strain 14 CSR 10 AI open reading frame polynucleotide 

sequences may be replaced by a polynucleotide sequence coding for a fragment of the replaced ORF. 
Alternatively, one or more of the S. pneumoniae strain 14 CSR 10 AI open reading frames may be 
replaced by a sequence having sequence homology to the replaced ORR 
5 One or more of the S, pneumoniae strain 14 CSR 10 AI surface protein sequences typically 

include an LPXTG motif (such as LPXTG (SEQ ID NO: 122)) or other sortase substrate motif. 

The S. pneumoniae strain 14 CSR 10 AI surface proteins of the invention may affect the 
ability of the S. pneumoniae bacteria to adhere to and invade epithelial cells. AI surface proteins may 
also affect the ability of S. pneumoniae to translocate through an epithelial cell layer. Preferably, one 
10 or more S. pneumoniae strain 14 CSR 10 AI surface proteins are capable of binding to or otherwise 
associating with an epithelial cell surface. S. pneumoniae strain 14 CSR 10 AI surface proteins may 
also be able to bind to or associate with fibrinogen, fibronectin, or collagen. 

The S. pneumoniae strain 14 CSR 10 AI sortase proteins are predicted to be involved in the 
secretion and anchoring of the LPXTG containing surface proteins. S. pneumoniae strain 14 CSR 10 
15 AI may encode for at least one surface protein. Alternatively, S. pneumoniae strain 14 CSR 10 AI 
may encode for at least two surface exposed proteins and at least one sortase. Preferably, S. 
pneumoniae strain 14 CSR 10 AI encodes for at least three surface exposed proteins and at least two 
sortases. 

The AI surface proteins may be covalently attached to the bacterial cell wall by membrane- 

20 associated transpeptidases, such as an AI sortase. The sortase may function to cleave the surface 

protein, preferably between the threonine and glycine residues of an LPXTG motif. The sortase may 
then assist in the formation of an amide link between the threonine carboxyl group and a cell wall 
precursor such as lipid II. The precursor can then be incorporated into the peptidoglycan via the 
transglycoslylation and transpeptidation reactions of bacterial wall synthesis. See Comfort et al., 

25 Infection & Immunity (2004) 72(5): 2710 - 2722. 

In one embodiment, the invention includes a composition comprising oligomeric, pilus-like 
structures comprising a S. pneumoniae strain 14 CSR 10 AI surface protein such as or£3_CSR 5 
orf4_CSR, or orf5_CSR. The oligomeric, pilus-like structure may comprise numerous units of AI 
surface protein. Preferably, the oligomeric, pilus-like structures comprise two or more AI surface 

30 proteins. Still more preferably, the oligomeric, pilus-like structure comprises a hyper-oligomeric 

pilus-like structure comprising at least two (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25, 30, 
35, 40, 45, 50, 60, 70, 80, 90, 100, 120, 140, 150, 200 or more) oligomeric subunits, wherein each 
subunit comprises an AI surface protein or a fragment thereof. The oligomeric subunits may be 
covalently associated via a conserved lysine within a pilin motif. The oligomeric subunits may be 

35 covalently associated via an LPXTG motif, preferably, via the threonine or serine amino acid residue, 
respectively. 

AI surface proteins or fragments thereof to be incorporated into the oligomeric, pilus-like 

structures of the invention will preferably include a pilin motif. 
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l|m ' 1L " The oll ! g^^ may be used alone or in the combinations of the 

invention. In one embodiment, the invention comprises a S. pneumoniae strain 14 CSR 10 AI protein 
in oligomeric form, preferably in a hyperoligomeric form. In one embodiment, the invention 
comprises a composition comprising one or more S. pneumoniae strain 14 CSR 10 AI proteins, and 
5 one or more AI proteins of any of S. pneumoniae from TIGR4, 670, 19A Hungary 6, 6B Finland 12, 
6B Spain 2, 9V Spain 3, 19F Taiwan 14, 23F Taiwan 15, or 23F Poland 16, wherein one or more of 
the S. pneumoniae AI proteins is in the form of an oligomer, preferably in a hyperoligomeric form. 

In addition to the open reading frames encoding the S. pneumoniae strain 14 CSR 10AI 
proteins, S. pneumoniae strain 14 CSR 10 AI may also include a transcriptional regulator. 
10 S, pneumoniae strain 19A Hungary 6 Adhesin Island 

As discussed above, Applicants have identified adhesin islands within the genome of S. 
pneumoniae strain 19A Hungary 6. The & pneumoniae strain 19A Hungary 6 Adhesin Island 
comprises a series of approximately seven open reading frames encoding for a collection of amino 
acid sequences comprising surface proteins and sortases. Specifically, the S. pneumoniae strain 19A 
15 Hungary 6 AI proteins includes open reading frames encoding for two or more (i.e., 2, 3, 4, 5, 6, or 7) 
* of ORF2_19AH, ORF3 19AH, ORF4_19AH, ORF5_19AH, ORF6_19AH, ORF7_19AH, 
ORF8J9AH. 

A preferred immunogenic composition of the invention comprises a S. pneumoniae strain 19A 
Hungary 6 AI surface protein which may be formulated or purified in an oligomeric (pilis) form. 
20 Another preferred immunogenic composition of the invention comprises a & pneumoniae strain 19A 
Hungary 6 AI surface protein which has been isolated in an oligomeric (pilis) form. 

One or more of the S. pneumoniae strain 19A Hungary 6 AI open reading frame 
polynucleotide sequences may be replaced by a polynucleotide sequence coding for a fragment of the 
replaced ORF. Alternatively, one or more of the S. pneumoniae strain 19A Hungary 6 AI open 
25 reading frames may be replaced by a sequence having sequence homology to the replaced ORF. 

One or more of the S. pneumoniae strain 19A Hungary 6 AI surface protein sequences 
typically include an LPXTG motif (such as LPXTG (SEQ ID NO: 122)) or other sortase substrate 
motif. 

The S. pneumoniae strain 19A Hungary 6 AI surface proteins of the invention may affect the 

30 ability of the S. pneumoniae bacteria to adhere to and invade epithelial cells. AI surface proteins may 

also affect the ability of S. pneumoniae to translocate through an epithelial cell layer. Preferably, one 

or more S. pneumoniae strain 19A Hungary 6 AI surface proteins are capable of binding to or 

otherwise associating with an epithelial cell surface. 3. pneumoniae strain 19A Hungary 6 AI surface 

proteins may also be able to bind to or associate with fibrinogen, fibronectin, or collagen. 

35 The S. pneumoniae strain 19A Hungary 6 AI sortase proteins are predicted to be involved in 

the secretion and anchoring of the LPXTG containing surface proteins. S. pneumoniae strain 19A 

Hungary 6 AI may encode for at least one surface protein. Alternatively, S. pneumoniae strain 19A 

Hungary 6 AI may encode for at least two surface exposed proteins and at least one sortase. 
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PilM^^ 6 AI encodes for at least three surface exposed 

proteins and at least two sortases. 

The AI surface proteins may be covalently attached to the bacterial cell wall by membrane- 
associated transpeptidases, such as an AI sortase. The sortase may function to cleave the surface 
5 protein, preferably between the threonine and glycine residues of an LPXTG motif. The sortase may 
then assist in the formation of an amide link between the threonine carboxyl group and a cell wall 
precursor such as lipid IL The precursor can then be incorporated into the peptidoglycan via the 
transglycoslylation and transpeptidation reactions of bacterial wall synthesis. See Comfort et al., 
Infection & Immunity (2004) 72(5): 2710 - 2722. 

10 In one embodiment, the invention includes a composition comprising oligomeric, pilus-like 

structures comprising a S. pneumoniae strain 19A Hungary 6 AI surface protein such as orf3_19AH, 
orf4_19AH, or orf5 19AH. The oligomeric, pilus-like structure may comprise numerous units of AI 
surface protein. Preferably, the oligomeric, pilus-like structures comprise two or more AI surface 
proteins. Still more preferably, the oligomeric, pilus-like structure comprises a hyper-oligomeric 

15 pilus-like structure comprising at least two (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25, 30, 
35, 40, 45, 50, 60, 70, 80, 90, 100, 120, 140, 150, 200 or more) oligomeric subunits, wherein each 
subunit comprises an AI surface protein or a fragment thereof The oligomeric subunits may be 
covalently associated via a conserved lysine within a pilin motif. The oligomeric subunits may be 
covalently associated via an LPXTG motif, preferably, via the threonine or serine amino acid residue, 

20 respectively. 

AI surface proteins or fragments thereof to be incorporated into the oligomeric, pilus-like 
structures of the invention will preferably include a pilin motif. 

The oligomeric, pilus like s tinctures may be used alone or in the combinations of the 
invention. In one embodiment, the invention comprises a S. pneumoniae strain 19A Hungaiy 6 AI 
25 protein in oligomeric form, preferably in a hyperoligomeric form. In one embodiment, the invention 
comprises a composition comprising one or more S. pneumoniae strain 19A Hungary 6 AI proteins 
and one or more AI proteins from one of any one of S. pneumoniae from TIGR4, 670, 14 CSR 10, 6B 
Finland 12, 6B Spain 2, 9V Spain 3, 19F Taiwan 14, 23F Taiwan 15, or 23F Poland 16 AI GR4 AI 
proteins, wherein one or more of the S. pneumoniae AI proteins is in the form of an oligomer, 
30 preferably in a hyperoligomeric form. 

In addition to the open reading frames encoding the S. pneumoniae strain 19A Hungary 6 AI 
proteins, S. pneumoniae strain 19A Hungary 6 AI may also include a transcriptional regulator. 
S. pneumoniae strain 19F Taiwan 14 Adhesin Island 

As discussed above, Applicants have identified adhesin islands within the genome of S. 

35 pneumoniae strain 19F Taiwan 14. The S. pneumoniae strain 19F Taiwan 14 Adhesin Island 

comprises a series of approximately seven open reading frames encoding for a collection of amino 

acid sequences comprising surface proteins and sortases. Specifically, the S. pneumoniae strain 19F 

Taiwan 14 AI proteins includes open reading frames encoding for two or more (i.e., 2, 3, 4, 5, 6, or 7) 
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o|aM>#Jw|j tMfeA Sfli^pibilPP 1 9FT W, ORF5_19FTW, ORF6_19FTW, ORF7_19FTW, 
ORF8_19FTW. 

A preferred immunogenic composition of the invention comprises a S. pneumoniae strain 19F 
Taiwan 14 AI surface protein which may be formulated or purified in an oligomeric (pilis) form. 
5 Another preferred immunogenic composition of the invention comprises a S. pneumoniae strain 19F 
Taiwan 14 AI surface protein which has been isolated in an oligomeric (pilis) form. 

One or more of the S. pneumoniae strain 19F Taiwan 14 AI open reading frame 
polynucleotide sequences may be replaced by a polynucleotide sequence coding for a fragment of the 
replaced ORP. Alternatively, one or more of the S. pneumoniae strain 19F Taiwan 14 AI open 
10 reading frames may be replaced by a sequence having sequence homology to the replaced ORF. 

One or more of the S. pneumoniae strain 19F Taiwan 14 AI surface protein sequences 
typically include an LPXTG motif (such as LPXTG (SEQ ID NO: 122)) or other sortase substrate 
motif. 

The S. pneumoniae strain 19F Taiwan 14 AI surface proteins of the invention may affect the 

15 ability of the S. pneumoniae bacteria to adhere to and invade epithelial cells. AI surface proteins may 
also affect the ability of S. pneumoniae to translocate through an epithelial cell layer. Preferably, one 
or more S. pneumoniae strain 19F Taiwan 14 AI surface proteins are capable of binding to or 
otherwise associating with an epithelial cell surface. S. pneumoniae strain 19F Taiwan 14 AI surface 
proteins may also be able to bind to or associate with fibrinogen, fibronectin, or collagen. 

20 The S. pneumoniae strain 19F Taiwan 14 AI sortase proteins are predicted to be involved in 

the secretion and anchoring of the LPXTG containing surface proteins. S. pneumoniae strain 19F 
Taiwan 14 AI may encode for at least one surface protein. Alternatively, S. pneumoniae strain 19F 
Taiwan 14 AI may encode for at least two surface exposed proteins and at least one sortase. 
Preferably, S. pneumoniae strain 19F Taiwan 14 AI encodes for at least three surface exposed proteins 

25 and at least two sortases. > 

The AI surface proteins may be covalently attached to the bacterial cell wall by membrane- 
associated transpeptidases, such as an AI sortase. The sortase may function to cleave the surface 
protein, preferably between the threonine and glycine residues of an LPXTG motif. The sortase may 
then assist in the formation of an amide link between the threonine carboxyl group and a cell wall 

30 precursor such as lipid II. The precursor can then be incorporated into the peptidoglycan via the 
transglycoslylation and transpeptidation reactions of bacterial wall synthesis. See Comfort et al., 
Infection & Immunity (2004) 72(5): 2710 - 2722. 

In one embodiment, the invention includes a composition comprising oligomeric, pilus-like 
structures comprising a S. pneumoniae strain 19F Taiwan 14 AI surface protein such as or£3_19FTW, 

35 orf4_19FTW, or orf5_19FTW. The oligomeric, pilus-like structure may comprise numerous units of 

AI surface protein. Preferably, the oligomeric, pilus-like structures comprise two or more AI surface 

proteins. Still more preferably, the oligomeric, pilus-like structure comprises a hyper-oligomeric 

pilus-like structure comprising at least two (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25, 30, 
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Jlff^T&^dfl^Q^t^^^^^ 150, 200 or more) oligomeric subunits, wherein each 
subunit comprises an AI surface protein or a fragment thereof. The oligomeric subunits may be 
covalently associated via a conserved lysine within a pilin motif. The oligomeric subunits may be 
covalently associated via an LPXTG motif, preferably, via the threonine or serine amino acid residue, 
5 respectively. 

AI surface proteins or fragments thereof to be incorporated into the oligomeric, pilus-like 
structures of the invention will preferably include a pilin motif. 

The oligomeric, pilus like structures may be used alone or in the combinations of the 
invention. In one embodiment, the invention comprises a S. pneumoniae strain 19F Taiwan 14 AI 

10 protein in oligomeric form, preferably in a hyperoligomeric form. In one embodiment, the invention 
comprises a composition comprising one or more S. pneumoniae strain 19F Taiwan 14 AI proteins 
and one or more AI proteins of any one or more of S. pneumoniae from TIGR4, 670, 19A Hungary 6, 
6B Finland 12, 6B Spain 2, 9V Spain 3,14 CSR 10, 23F Taiwan 15, or 23 F Poland 16, wherein one or 
more of the S. pneumoniae AI proteins is in the form of an oligomer, preferably in a hyperoligomeric 

15 form. 

In addition to the open reading frames encoding the S. pneumoniae strain 19F Taiwan 14 AI 
proteins, S. pneumoniae strain 19F Taiwan 14 AI may also include a transcriptional regulator. 
£ pneumoniae strain 23F Poland 16 Adhesin Island 

As discussed above, Applicants have identified adhesin islands within the genome of S. 

20 pneumoniae strain 23F Poland 16. The S. pneumoniae strain 23F Poland 16 Adhesin Island comprises 
a series of approximately seven open reading frames encoding for a collection of amino acid 
sequences comprising surface proteins and sortases. Specifically, the S. pneumoniae strain 23F 
Poland 16 AI proteins includes open reading frames encoding for two or more (i.e., 2, 3, 4, 5, 6, or 7) 
of ORF2_23FP, ORF3J23FP, ORF4_23FP, ORF5_23FP, ORF6_23FP, ORF7J23FP, and 

25 ORF8_23FP. 

A preferred immunogenic composition of the invention comprises a S. pneumoniae strain 23F 
Poland 1 6 AI surface protein which may be formulated or purified in an oligomeric (pilis) form. 
Another preferred immunogenic composition of the invention comprises a S. pneumoniae strain 23F 
Poland 16 AI surface protein which has been isolated in an oligomeric (pilis) form. 

30 One or more of the S. pneumoniae strain 23F Poland 16 AI open reading frame 

polynucleotide sequences may be replaced by a polynucleotide sequence coding for a fragment of the 
replaced ORF. Alternatively, one or more of the S. pneumoniae strain 23F Poland 16 AI open reading 
frames may be replaced by a sequence having sequence homology to the replaced ORF. 

One or more of the S. pneumoniae strain 23F Poland 16 AI surface protein sequences 

35 typically include an LPXTG motif (such as LPXTG (SEQ ID NO: 122)) or other sortase substrate 
motif. 

The S. pneumoniae strain 23F Poland 16 AI surface proteins of the invention may affect the 

ability of the S. pneumoniae bacteria to adhere to and invade epithelial cells. AI surface proteins may 
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^|lff§cj:^ybJffili^ Eisfp2sZmMZ l B translocate through an epithelial cell layer. Preferably, one 
or more S. pneumoniae strain 23F Poland 16 AI surface proteins are capable of binding to or 
otherwise associating with an epithelial cell surface. S. pneumoniae strain 23 F Poland 16 AI surface 
proteins may also be able to bind to or associate with fibrinogen, fibronectin, or collagen. 
5 The S. pneumoniae strain 23F Poland 16 AI sortase proteins are predicted to be involved in 

the secretion and anchoring of the LPXTG containing surface proteins. S. pneumoniae strain 23F 
Poland 16 AI may encode for at least one surface protein. Alternatively, & pneumoniae strain 23F 
Poland 1 6 AI may encode for at least two surface exposed proteins and at least one sortase. 
Preferably, S. pneumoniae strain 23F Poland 16 Al encodes for at least three surface exposed proteins 

10 and at least two sortases. 

The AI surface proteins may be covalently attached to the bacterial cell wall by membrane- 
associated transpeptidases, such as an AI sortase. The sortase may function to cleave the surface 
protein, preferably between the threonine and glycine residues of an LPXTG motif. The sortase may 
then assist in the formation of an amide link between the threonine carboxyl group and a cell wall 

1 5 precursor such as lipid II The precursor can then be incorporated into the peptidoglycan via the 
transglycoslylation and transpeptidation reactions of bacterial wall synthesis. See Comfort et al., 
Infection & Immunity (2004) 72(5): 2710 - 2722. 

In one embodiment, the invention includes a composition comprising oligomeric, pilus-like 
structures comprising a S. pneumoniae strain 23F Poland 16 AI surface protein such as orf3_23FP, 

20 orf4_23FP, or orf5_23FP. The oligomeric, pilus-like structure may comprise numerous units of AI 
surface protein. Preferably, the oligomeric, pilus-like structures comprise two or more AI surface 
proteins. Still more preferably, the oligomeric, pilus-like structure comprises a hyper-oligomeric 
pilus-like structure comprising at least two (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25, 30, 
35, 40, 45, 50, 60, 70, 80, 90, 100, 120, 140, 150, 200 or more) oligomeric subunits, wherein each 

25 subunit comprises an AI surface protein or a fragment thereof. The oligomeric subunits may be 

covalently associated via a conserved lysine within a pilin motif. The oligomeric subunits may be 
covalently associated via an LPXTG motif, preferably, via the threonine or serine amino acid residue, 
respectively. 

AI surface proteins or fragments thereof to be incorporated into the oligomeric, pilus-like 

30 structures of the invention will preferably include a pilin motif. 

The oligomeric, pilus like structures may be used alone or in the combinations of the 
invention. In one embodiment, the invention comprises a & pneumoniae strain 23F Poland 16 AI 
protein in oligomeric form, preferably in a hyperoligomeric form. In one embodiment, the invention 
comprises a composition comprising one or more S. pneumoniae strain 23F Poland 16 AI proteins and 

35 one or more AI proteins from any one or more S. pneumoniae strains of TIGR4, 670, 19A Hungary 6, 
6B Finland 12, 6B Spain 2, 9V Spain 3, 19F Taiwan 14, 23F Taiwan 15, or 14 CSR 10, wherein one 
or more of the S. pneumoniae AI proteins is in the form of an oligomer, preferably in a 
hyperoligomeric form. 
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P C TEEl^MjIdHt^ encoding the S. pneumoniae strain 23F Poland 16 AI 

proteins, S. pneumoniae strain 23F Poland 16 AI may also include a transcriptional regulator. 
S. pneumoniae strain 23F Taiwan 15 Adhesin Island 

As discussed above, Applicants have identified adhesin islands within the genome ofS. 
pneumoniae strain 23F Taiwan 15. The S. pneumoniae strain 23F Taiwan 15 Adhesin Island 
comprises a series of approximately seven open reading frames encoding for a collection of amino 
acid sequences comprising surface proteins and sortases. Specifically, the S. pneumoniae strain 23F 
Taiwan 15 AI proteins includes open reading frames encoding for two or more (z.e, 2, 3, 4, 5, 6, or 7) 
of ORF2_23FTW, ORF3_23FTW, ORF4_23FTW, ORF5_23FTW, ORF6_23FTW, ORF7_23FTW, 
ORF8_23FTW. 

A preferred immunogenic composition of the invention comprises a S. pneumoniae strain 23F 
Taiwan 15 AI surface protein which may be formulated or purified in an oligomeric (pilis) form. 
Another preferred immunogenic composition of the invention comprises a S. pneumoniae strain 23F 
Taiwan 15 AI surface protein which has been isolated in an oligomeric (pilis) form. 

One or more of the S. pneumoniae strain 23F Taiwan 15 AI open reading frame 
polynucleotide sequences may be replaced by a polynucleotide sequence coding for a fragment of the 
replaced ORF. Alternatively, one or more of the S. pneumoniae strain 23F Taiwan 15 AI open 
reading frames may be replaced by a sequence having sequence homology to the replaced ORF. 

One or more of the S. pneumoniae strain 23F Taiwan 15 AI surface protein sequences 
typically include an LPXTG motif (such as LPXTG (SEQ ID NO: 122)) or other sortase substrate 
motif. 

The S. pneumoniae strain 23F Taiwan 15 AI surface proteins of the invention may affect the 
ability of the S. pneumoniae bacteria to adhere to and invade epithelial cells. AI surface proteins may 
also affect the ability of S. pneumoniae to translocate through an epithelial cell layer. Preferably, one 
or more S. pneumoniae strain 23F Taiwan 15 AI surface proteins are capable of binding to or 
otherwise associating with an epithelial cell surface. S. pneumoniae strain 23 F Taiwan 15 AI surface 
proteins may also be able to bind to or associate with fibrinogen, fibronectin, or collagen. 

The S. pneumoniae strain 23F Taiwan 15 AI sortase proteins are predicted to be involved in 
the secretion and anchoring of the LPXTG containing surface proteins. S. pneumoniae strain 23F 
Taiwan 15 AI may encode for at least one surface protein. Alternatively, S. pneumoniae strain 23 F 
Taiwan 15 AI may encode for at least two surface exposed proteins and at least one sortase. 
Preferably, S. pneumoniae strain 23F Taiwan 15 AI encodes for at least three surface exposed proteins 
and at least two sortases. 

The AI surface proteins may be covalently attached to the bacterial cell wall by membrane- 
associated transpeptidases, such as an AI sortase. The sortase may function to cleave the surface 
protein, preferably between the threonine and glycine residues of an LPXTG motif. The sortase may 
then assist in the formation of an amide link between the threonine carboxyl group and a cell wall 

precursor such as lipid IL The precursor can then be incorporated into the peptidoglycan via the 
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tipn@}$QS^ of bacterial wall synthesis. See Comfort et al., 

Infection & Immunity (2004) 72(5): 2710 - 2722. 

In one embodiment, .the invention includes a composition comprising oligomeric, pilus-like 
structures comprising a S. pneumoniae strain 23F Taiwan 15 AI surface protein such as orf3_23FTW, 
orf4__23FTW, or orf5_23FTW. The oligomeric, pilus-like structure may comprise numerous units of 
AI surface protein. Preferably, the oligomeric, pilus-like structures comprise two or more AI surface 
proteins. Still more preferably, the oligomeric, pilus-like structure comprises a hyper-oligomeric 
pilus-like structure comprising at least two {e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25, 30, 
35, 40, 45, 50, 60, 70, 80, 90, 100, 120, 140, 150, 200 or more) oligomeric subunits, wherein each 
subunit comprises an AI surface protein or a fragment thereof. The oligomeric subunits may be 
covalently associated via a conserved lysine within a pilin motif. The oligomeric subunits may be 
covalently associated via an LPXTG motif, preferably, via the threonine or serine amino acid residue, 
respectively. 

AI surface proteins or fragments thereof to be incorporated into the oligomeric, pilus-like 
structures of the invention will preferably include a pilin motif 

The oligomeric, pilus like structures may be used alone or in the combinations of the 
invention. In one embodiment, the invention comprises a S. pneumoniae strain 23F Taiwan 15 AI 
protein in oligomeric form, preferably in a hyperoligomeric form. In one embodiment, the invention 
comprises a composition comprising one or more S, pneumoniae strain 23F Taiwan 15 AI proteins 
and one or more AI proteins from any one or more of S. pneumoniae from TIGR4, 670, 19A Hungary 
6, 6B Finland 12, 6B Spain 2, 9V Spain 3, 19F Taiwan 14, 14 CSR 10, or 23F Poland 16 AI, wherein 
one or more of the S. pneumoniae AI proteins is in the form of an oligomer, preferably in a 
hyperoligomeric form. 

In addition to the open reading frames encoding the S. pneumoniae strain 23F Taiwan 15 AI 
proteins, S. pneumoniae strain 23F Taiwan 15 AI may also include a transcriptional regulator. 
S. pneumoniae strain 6B Finland 12 Adhesin Island 

As discussed above, Applicants have identified adhesin islands within the genome of S. 
pneumoniae strain 6B Finland 12. The S. pneumoniae strain 6B Finland 12 Adhesin Island comprises 
a series of approximately seven open reading frames encoding for a collection of amino acid 
sequences comprising surface proteins and sortases. Specifically, the S. pneumoniae strain 6B 
Finland 12 AI proteins includes open reading frames encoding for two or more {i.e., 2, 3, 4, 5, 6, or 7) 
of ORF2 6BF, ORF3J5BF, ORF4 6BF, ORF5_6BF, ORF6_6BF, ORF7_6BF, ORF8_6BF. 

A preferred immunogenic composition of the invention comprises a S. pneumoniae strain 6B 
Finland 12 AI surface protein which may be formulated or purified in an oligomeric (pilis) form. 
Another preferred immunogenic composition of the invention comprises a S. pneumoniae strain 6B 
Finland 12 AI surface protein which has been isolated in an oligomeric (pilis) form. 

One or more of the S. pneumoniae strain 6B Finland 12 AI open reading frame polynucleotide 

sequences may be replaced by a polynucleotide sequence coding for a fragment of the replaced ORP. 
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^^tte^atSv^liyiL^^gi! 4IBit}^I:e , ' < o£!!t£^ , iS::!jpa5fe&3ifcowziare strain 6B Finland 12 AI open reading frames may be 
replaced by a sequence having sequence homology to the replaced ORF. 

One or more of the S. pneumoniae strain 6B Finland 12 AI surface protein sequences 
typically include an LPXTG motif (such as LPXTG (SEQ ID NO: 122)) or other sortase substrate 
5 motif. 

The S. pneumoniae strain 6B Finland 12 AI surface proteins of the invention may affect the 
ability of the S. pneumoniae bacteria to adhere to and invade epithelial cells. AI surface proteins may 
also affect the ability of S. pneumoniae to translocate through an epithelial cell layer. Preferably, one 
or more S. pneumoniae strain 6B Finland 12 AI surface proteins are capable of binding to or 

10 otherwise associating with an epithelial cell surface. S. pneumoniae strain 6B Finland 12 AI surface 
proteins may also be able to bind to or associate with fibrinogen, fibronectin, or collagen. 

The S. pneumoniae strain 6B Finland 12 AI sortase proteins are predicted to be involved in 
the secretion and anchoring of the LPXTG containing surface proteins. S. pneumoniae strain 6B 
Finland 12 AI may encode for at least one surface protein. Alternatively, S. pneumoniae strain 6B 

15 Finland 12 AI may encode for at least two surface exposed proteins and at least one sortase. 

Preferably, S. pneumoniae strain 6B Finland 12 AI encodes for at least three surface exposed proteins 
and at least two sortases. 

The AI surface proteins may be covalently attached to the bacterial cell wall by membrane- 
associated transpeptidases, such as an AI sortase. The sortase may function to cleave the surface 

20 protein, preferably between the threonine and glycine residues of an LPXTG motif. The sortase may 
then assist in the formation of an amide link between the threonine carboxyl group and a cell wall 
precursor such as lipid II. The precursor can then be incorporated into the peptidoglycan via the 
transglycoslylation and transpeptidation reactions of bacterial wall synthesis. See Comfort et al., 
Infection & Immunity (2004) 72(5): 2710 - 2722. 

25 In one embodiment, the invention includes a composition comprising oligomeric, pilus-like 

structures comprising a S. pneumoniae strain 6B Finland 12 AI surface protein such as orf3_6BF, 
orf4_6BF, or orf5_6BF. The oligomeric, pilus-like structure may comprise numerous units of AI 
surface protein. Preferably, the oligomeric, pilus-like structures comprise two or more AI surface 
proteins. Still more preferably, the oligomeric, pilus-like structure comprises a hyper-oligomeric 

30 pilus-like structure comprising at least two (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 15, 20, 25, 30, 
35, 40, 45, 50, 60, 70, 80, 90, 100, 120, 140, 150, 200 or more) oligomeric subunits, wherein each 
subunit comprises an AI surface protein or a fragment thereof. The oligomeric subunits may be 
covalently associated via a conserved lysine within a pilin motif. The oligomeric subunits may be 
covalently associated via an LPXTG motif, preferably, via the threonine or serine amino acid residue, 

35 respectively. 

AI surface proteins or fragments thereof to be incorporated into the oligomeric, pilus-like 
structures of the invention will preferably include a pilin motif. 
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jp £• l^^^ifcSiOC&jKiSpi ffuejyips may be used alone or in the combinations of the 
invention. In one embodiment, the invention comprises a S. pneumoniae strain 6B Finland 12 AI 
protein in oligomeric form, preferably in a hyperoligomeric form. In one embodiment, the invention 
comprises a composition comprising one or more S. pneumoniae strain 6B Finland 12 AI proteins and 
5 one or more AI proteins of any one or more of S. pneumoniae from TIGR4, 670, 19A Hungary 6, 6B 
Finland 12, 6B Spain 2, 9V Spain 3, 19F Taiwan 14, 23F Taiwan 15, or 23F Poland 16 AI, wherein 
one or more of the & pneumoniae AI proteins is in the form of an oligomer, preferably in a 
hyperoligomeric form. 

In addition to the open reading frames encoding the S. pneumoniae strain 6B Finland 12 AI 
10 proteins, S. pneumoniae strain 6B Finland 12 AI may also include a transcriptional regulator. 
S. pneumoniae strain 6B Spain 2 Adhesin Island 

As discussed above, Applicants have identified adhesin islands within the genome of S. 
pneumoniae strain 6B Spain 2. The & pneumoniae strain 6B Spain 2 Adhesin Island comprises a 
series of approximately seven open reading frames encoding for a collection of amino acid sequences 
15 comprising surface proteins and sortases. Specifically, the & pneumoniae strain 6B Spain 2 AI 
proteins includes open reading frames encoding for two or more (i.e., 2, 3, 4, 5, 6, or 7) of 
ORF2J5BSP, ORF3_6BSP, ORF4 6BSP, ORF5 6BSP, ORF66BSP, ORF7_6BSP, and 
ORF8_6BSP. 

A preferred immunogenic composition of the invention comprises a S. pneumoniae strain 6B 

20 Spain 2 AI surface protein which may be formulated or purified in an oligomeric (pilis) form. 

Another preferred immunogenic composition of the invention comprises a 8. pneumoniae strain 6B 

Spain 2 AI surface protein which has been isolated in an oligomeric (pilis) form. 

One or more of the 5. pneumoniae strain 6B Spain 2 AI open reading frame polynucleotide 

sequences may be replaced by a polynucleotide sequence coding for a fragment of the replaced ORF. 

25 Alternatively, one or more of the S. pneumoniae strain 6B Spain 2 AI open reading frames may be 

replaced by a sequence having sequence homology to the replaced ORF. 

One or more of the S. pneumoniae strain 6B Spain 2 AI surface protein sequences typically 

include an LPXTG motif (such as LPXTG (SEQ ID NO: 122)) or other sortase substrate motif. 

The S. pneumoniae strain 6B Spain 2 AI surface proteins of the invention may affect the 

30 ability of the S. pneumoniae bacteria to adhere to and invade epithelial cells. AI surface proteins may 

also affect the ability of S. pneumoniae to translocate through an epithelial cell layer. Preferably, one 

or more S. pneumoniae strain 6B Spain 2 AI surface proteins are capable of binding to or otherwise 

associating with an epithelial cell surface. S. pneumoniae strain 6B Spain 2 AI surface proteins may 

also be able to bind to or associate with fibrinogen, fibronectin, or collagen. 

35 The S. pneumoniae strain 6B Spain 2 AI sortase proteins are predicted to be involved in the 

secretion and anchoring of the LPXTG containing surface proteins. iS*. pneumoniae strain 6B Spain 2 

AI may encode for at least one surface protein. Alternatively, S. pneumoniae strain 6B Spain 2 AI 

may encode for at least two surface exposed proteins and at least one sortase. Preferably, & 
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pnSuntbniae strMWSp4ih 2 AI encodes *ror at least three surface exposed proteins and at least two 

sortases. 

The AI surface proteins may be covalently attached to the bacterial cell wall by membrane- 
associated transpeptidases, such as an AI sortase. The sortase may function to cleave the surface 
protein, preferably between the threonine and glycine residues of an LPXTG motif. The sortase may 
then assist in the formation of an amide link between the threonine carboxyl group and a cell wall 
precursor such as lipid II. The precursor can then be incorporated into the peptidoglycan via the 
transglycoslylation and transpeptidation reactions of bacterial wall synthesis. See Comfort et aL, 
Infection & Immunity (2004) 72(5): 2710 - 2722. 

In one embodiment, the invention includes a composition comprising oligomeric, pilus-like 
structures comprising a S. pneumoniae strain 6B Spain 2 AI surface protein such as orf3_6BSP, 
orf4_6BSP, or orf5_6BSP. The oligomeric, pilus-like structure may comprise numerous units of AI 
surface protein. Preferably, the oligomeric, pilus-like structures comprise two or more AI surface 
proteins. Still more preferably, the oligomeric, pilus-like structure comprises a hyper-oligomeric 
pilus-like structure comprising at least two {e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25, 30, 
35, 40, 45, 50, 60, 70, 80, 90, 100, 120, 140, 150, 200 or more) oligomeric subunits, wherein each 
subunit comprises an AI surface protein or a fragment thereof. The oligomeric subunits may be 
covalently associated via a conserved lysine within a pilin motif. The oligomeric subunits may be 
covalently associated via an LPXTG motif, preferably, via the threonine or serine amino acid residue, 
respectively. 

AI surface proteins or fragments thereof to be incorporated into the oligomeric, pilus-like 
structures of the invention will preferably include a pilin motif. 

The oligomeric, pilus like structures may be used alone or in the combinations of the 
invention. In one embodiment, the invention comprises a S. pneumoniae strain 6B Spain 2 AI protein 
in oligomeric form, preferably in a hyperoligomeric form. In one embodiment, the invention 
comprises a composition comprising one or more S. pneumoniae strain 6B Spain 2 AI proteins and 
one or more AI proteins of any one or more of S. pneumoniae from TIGR4, 670, 19A Hungary 6, 6B 
Finland 12, 14 CSR 10, 9V Spain 3, 19F Taiwan 14, 23F Taiwan 15, or 23F Poland 16 AI, wherein 
one or more of the S. pneumoniae AI proteins is in the form of an oligomer, preferably in a 
hyperoligomeric form. 

In addition to the open reading frames encoding the S. pneumoniae strain 6B Spain 2 AI 
proteins, S. pneumoniae strain 6B Spain 2 AI may also include a transcriptional regulator. 
S. pneumoniae strain 9V Spain 3 Adhesin Island 

As discussed above, Applicants have identified adhesin islands within the genome of S. 

pneumoniae strain 9V Spain 3. The S. pneumoniae strain 9V Spain 3 Adhesin Island comprises a 

series of approximately seven open reading frames encoding for a collection of amino acid sequences 

comprising surface proteins and sortases. Specifically, the S. pneumoniae strain 9V Spain 3 AI 

proteins includes open reading frames encoding for two or more (i.e., 2, 3, 4, 5, 6, or 7) of 
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ORF2_9VSP, ORF3__9VSP, ORF4_9VSp;ORF5_?VSP, ORF6_9VSP, ORF7_9VSP, and 
OKF8_9VSP. 

A preferred immunogenic composition of the invention comprises a S. pneumoniae strain 9V 
Spain 3 AI surface protein which may be formulated or purified in an oligomeric (pilis) form. 
5 Another preferred immunogenic composition of the invention comprises a S. pneumoniae strain 9V 
Spain 3 AI surface protein which has been isolated in an oligomeric (pilis) form. 

One or more of the & pneumoniae strain 9 V Spain 3 AI open reading frame polynucleotide 
sequences may be replaced by a polynucleotide sequence coding for a fragment of the replaced ORF. 
Alternatively, one or more of the S. pneumoniae strain 9V Spain 3 AI open reading frames may be 
10 replaced by a sequence having sequence homology to the replaced ORF. 

One or more of the S. pneumoniae strain 9V Spain 3 AI surface protein sequences typically 
include an LPXTG motif (such as LPXTG (SEQ ID NO: 122)) or other sortase substrate motif. 

The & pneumoniae strain 9V Spain 3 AI surface proteins of the invention may affect the 
ability of the S. pneumoniae bacteria to adhere to and invade epithelial cells. AI surface proteins may 
15 also affect the ability of S. pneumoniae to translocate through an epithelial cell layer. Preferably, one 
or more S. pneumoniae strain 9V Spain 3 AI surface proteins are capable of binding to or otherwise 
associating with an epithelial cell surface. S. pneumoniae strain 9V Spain 3 AI surface proteins may 
also be able to bind to or associate with fibrinogen, fibronectin, or collagen. 

The S. pneumoniae strain 9V Spain 3 AI sortase proteins are predicted to be involved in the 
20 secretion and anchoring of the LPXTG containing surface proteins. S. pneumoniae strain 9 V Spain 3 
AI may encode for at least one surface protein. Alternatively, S. pneumoniae strain 9V Spain 3 AI 
may encode for at least two surface exposed proteins and at least one sortase. Preferabfy, S. 
pneumoniae strain 9V Spain 3 AI encodes for at least three surface exposed proteins and at least two 
sortases. 

25 The AI surface proteins may be covalently attached to the bacterial cell wall by membrane- 

associated transpeptidases, such as an AI sortase. The sortase may function to cleave the surface 
protein, preferably between the threonine and glycine residues of an LPXTG motif. The sortase may 
then assist in the formation of an amide link between the threonine carboxyl group and a cell wall 
precursor such as lipid II. The precursor can then be incorporated into the peptidoglycan via the 

30 transglycoslylation and transpeptidation reactions of bacterial wall synthesis. See Comfort et al., 
Infection & Immunity (2004) 72(5): 2710 - 2722. 

In one embodiment, the invention includes a composition comprising oligomeric, pilus-like 
structures comprising a S. pneumoniae strain 9V Spain 3 AI surface protein such as orf3_9VSP, 
orf4_9VSP, or orf5_9VSP. The oligomeric, pilus-like structure may comprise numerous units of AI 

35 surface protein. Preferably, the oligomeric, pilus-like structures comprise two or more AI surface 

proteins. Still more preferably, the oligomeric, pilus-like structure comprises a hyper-oligomeric 

pilus-like structure comprising at least two (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25, 30, 

35, 40, 45, 50, 60, 70, 80, 90, 100, 120, 140, 150, 200 or more) oligomeric subunits, wherein each 
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subunit comprises an AI surface protein or a fragment thereof. The oligomeric subunits may be 
covalently associated via a conserved lysine within a pilin moti£ The oligomeric subunits may be 
covalently associated via an LPXTG motif, preferably, via the threonine or serine amino acid residue, 
respectively. 

5 AI surface proteins or fragments thereof to be incorporated into the oligomeric, pilus-like 

structures of the invention will preferably include a pilin motif. 

The oligomeric, pilus like structures may be used alone or in the combinations of the 
invention. In one embodiment, the invention comprises a S. pneumoniae strain 9V Spain 3 AI protein 
in oligomeric form, preferably in a hyperoligomeric form. In one embodiment, the invention 
10 comprises a composition comprising one or more S. pneumoniae strain 9V Spain 3 AI proteins and 
one or more AI proteins from any one or more of S. pneumoniae from TIGR4, 670, 19A Hungary 6, 
6B Finland 12, 6B Spain 2, 14 CSR 10, 19F Taiwan 14, 23F Taiwan 15, or 23F Poland 16 AI, 
wherein one or more of the S. pneumoniae AI proteins is in the form of an oligomer, preferably in a 
hyperoligomeric form. 

15 In addition to the open reading frames encoding the S, pneumoniae strain 9V Spain 3 AI 

proteins, & pneumoniae strain 9V Spain 3 AI may also include a transcriptional regulator. 

The S. pneumoniae oligomeric, pilus-like structures may be isolated or purified from bacterial 
cultures in which the bacteria express an S. pneumoniae AI surface protein. The invention therefore 
includes a method for manufacturing an oligomeric AI surface antigen comprising culturing a S. 

20 pneumoniae bacterium that expresses the oligomeric AI protein and isolating the expressed 

oligomeric AI protein from the S. pneumoniae bacteria. The AI protein may be collected from 
secretions into the supernatant or it may be purified from the bacterial surface. The method may 
further comprise purification of the expressed AI protein. Preferably, the AI protein is in a 
hyperoligomeric form. 

25 The oligomeric, pilus-like structures may be isolated or purified from bacterial cultures 

overexpressing an AI surface protein. The invention therefore includes a method for manufacturing 
an S. pneumoniae oligomeric Adhesin Island surface antigen comprising culturing a S. pneumoniae 
bacterium adapted for increased AI protein expression and isolation of the expressed oligomeric 
Adhesin Island protein from the S. pneumoniae bacteria. The AI protein may be collected from 

30 secretions into the supernatant or it may be purified from the bacterial surface. The method may 

further comprise purification of the expressed Adhesin Island protein. Preferably, the Adhesin Island 
protein is in a hyperoligomeric form. 

The S. pneumoniae bacteria are preferably adapted to increase AI protein expression by at 
least two (e.g. 9 2, 3, 4, 5, 8, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 125, 150 or 200) 

35 times wild type expression levels. 

S. pneumoniae bacteria may be adapted to increase AI protein expression by any means 

known in the art, including methods of increasing gene dosage and methods of gene upregulation. 

Such means include, for example, transformation of the S. pneumoniae bacteria with a plasmid 

-82- 



WO 2006/078318 PCT7US2005/027239 

pcT/us.os/e^eBgt, A t t . t . , . lh . . 

encoding the AI protein. The plasmid may include a strong promoter or it may include multiple 
copies of the sequence encoding the AI protein. Optionally, the sequence encoding the AI protein 
within the S. pneumoniae bacterial genome may be deleted. Alternatively, or in addition, the 
promoter regulating the S. pneumoniae Adhesin Island may be modified to increase expression. 
5 The invention further includes S. pneumoniae bacteria which have been adapted to produce 

increased levels of AI surface protein. In particular, the invention includes S. pneumoniae bacteria 
which have been adapted to produce oligomeric or hyperoligomeric AI surface protein. In one 
embodiment, the S. pneumoniae of the invention are inactivated or attenuated to permit in vivo 
delivery of the whole bacteria, with the AI surface protein exposed on its surface. 

1 0 The invention further includes S. pneumoniae bacteria which have been adapted to have 

increased levels of expressed AI protein incorporated in pili on their surface. The S, pneumoniae 
bacteria may be adapted to have increased exposure of oligomeric or hyperoligomeric AI proteins on 
its surface by increasing expression levels of a signal peptidase polypeptide. Increased levels of a 
local signal peptidase expression in Gram positive bacteria (such us LepA in GAS) are expected to 

1 5 result in increased exposure of pili proteins on the surface of Gram positive bacteria. Increased 

expression of a leader peptidase in S. pneumoniae may be achieved by any means known in the art, 
such as increasing gene dosage and methods of gene upregulation. The S. pneumoniae bacteria 
adapted to have increased levels of leader peptidase may additionally be adapted to express increased 
levels of at least one pili protein. 

20 Alternatively, the AI proteins of the invention may be expressed on the surface of a non- 

pathogenic Gram positive bacteria, such as Streptococus gordonii (See, e.g., Byrd et aL, "Biological 
consequences of antigen and cytokine co-expression by recombinant Streptococcus gordonii vaccine 
vectors", Vaccine (2002) 20:2197-2205) or Lactococcus lactis (See, e.g., Mannam et al., "Mucosal 
Vaccine Made from Live, Recombinant Lactococcus lactis Protects Mice against Pharangeal Infection 

25 with Streptococcus pyogenes" Infection and Immunity (2004) 72(6): 3 444-3 450). As used herein, 
non-pathogenic Gram positive bacteria refer to Gram positive bacteria which are compatible with a 
human host subject and are not associated with human pathogenisis. Preferably, the non-pathogenic 
bacteria are modified to express the AI surface protein in oligomeric, or hyper-oligomeric form. 
Sequences encoding for an AI surface protein and, optionally, an AI sortase, may be integrated into 

30 the non-pathogenic Gram positive bacterial genome or inserted into a plasmid. The non-pathogenic 
Gram positive bacteria may be inactivated or attenuated to facilitate in vivo delivery of the whole 
bacteria, with the AI surface protein exposed on its surface. Alternatively, the AI surface protein may 
be isolated or purified from a bacterial culture of the non-pathogenic Gram positive bacteria. For 
example, the AI surface protein may be isolated from cell extracts or culture supernatants. 

35 Alternatively, the AI surface protein may be isolated or purified from the surface of the non- 
pathogenic Gram positive bacteria. 

The non-pathogenic Gram positive bacteria may be used to express any of the S. pneumoniae 

Adhesin Island proteins described herein. The non-pathogenic Gram positive bacteria are transformed 
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to express an Adhesin Island surface protein. Preferably, the non-pathogenic Gram positive bacteria 
also express at least one Adhesin Island sortase. The AI transformed non-pathogenic Gram positive 
bacteria of the invention may be used to prevent or treat infection with pathogenic S. pneumoniae. 

Figures 190 A and B, and 193-195 provide examples of three methods successfully practiced 
5 by applicants to purify pili from S. pneumoniae TIGR4. 

Immunogenic Compositions 

The Gram positive bacteria AI proteins described herein are useful in immunogenic 
compositions for the prevention or treatment of Gram positive bacterial infection. For example, the 

10 GBS AI surface proteins described herein are useful in immunogenic compositions for the prevention 
or treatment of GBS infection. As another example, the GAS AI surface proteins described herein 
may be useful in immunogenic compositions for the prevention or treatment of GAS infection. As 
another example, the S. pneumoniae AI surface proteins may be useful in immunogenic cojmpositions 
for the prevention or treatment ofS. pneumoniae infection. 

1 5 Gram positive bacteria AI surface proteins that can provide protection across more than one 

serotype or strain isolate may be used to increase immunogenic effectiveness. For example, a 
particular GBS AI surface protein having an amino acid sequence that is at least 50% (i.e., at least 
55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100%) homologous to 
the particular GBS AI surface protein of at least 2 (i.e., at least 3, 4, 5, 6, 7, 8, 9, 10, or more) other 

20 GBS serotypes or strain isolates may be used to increase the effectiveness of such compositions. 

As another example, fragments of Gram positive bacteria AI surface proteins that can provide 
protection across more than one serotype or strain isolate may be used to increase immunogenic 
effectiveness. Such a fragment may be identified within a consensus sequence of a full length amino 
acid sequence of a Gram positive bacteria AI surface protein. Such a fragment can be identified in the 

25 consensus sequence by its high degree of homology or identity across multiple (i.e, at least 3, 4, 5, 6, 
7, 8, 9, 10, or more) Gram positive bacteria serotypes or strain isolates. Preferably, a high degree of 
' homology is a degree of homology of at least 90% (i.e., at least 90%, 91%, 92%, 93%, 94%, 95%, 
96%, 97%, 98%, 99%, or 100%) across Gram positive bacteria serotypes or strain isolates. 
Preferably, a high degree of identity is a degree of identity of at least 90% (i.e., at least 90%, 91%, 

30 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100%) across Gram positive bacteria serotypes or 
strain isolates. In one embodiment of the invention, such a fragment of a Gram positive bacteria AI 
surface protein may be used in the immunogenic compositions. 

In addition, the AI surface protein oligomeric pilus structures may be formulated or purified 
for use in immunization. Isolated AI surface protein oligomeric pilus structures may also be used for 

35 immunization. 

The invention includes an immunogenic composition comprising a first Gram positive 
bacteria AI protein and a second Gram positive bacterial AI protein. One or more of the AI proteins 
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may be a surfJd^lM^? Suffi ! stirfice : |rdlein^ may contain an LPXTG motif or other sortase 
substrate motif. 

The first and second AI proteins may be from the same or different genus or species of Gram 
positive bacteria. If within the same species, the first and second AI proteins may be from the same or 
5 different AI subtypes. If two AIs are of the same subtype, the AIs have the same numerical 

designation. For example, all AIs designated as AI- 1 are of the same AI subtype. If two AIs are of a 
different subtype, the AIs have different numerical designations. For example, AI-1 is of a different 
AI subtype from AI-2, AI-3, AI-4, etc. Likewise, AI-2 is of a different AI subtype from AI-1, AI-3, 
and AI-4, etc. 

1 0 For example, the invention includes an immunogenic composition comprising one or more 

GBS AI-1 proteins and one'or more GBS AI-2 proteins. One or more of the AI proteins may be a 
surface protein. Such surface proteins may contain an LPXTG motif (such as LPXTG (SEQ ID NO: 
122)) and may bind fibrinogen, fibronectin, or collagen. One or more of the AI proteins may be a 
sortase. The GBS AI-1 proteins may be selected from the group consisting of GBS 80, GBS 104, 

15 GBS 52, SAG0647 and SAG0648. Preferably, the GBS AI-1 proteins include GBS 80 or GBS 104. 

The GBS AI-2 proteins maybe selected from the group consisting of GBS 67, GBS 59, GBS 
150, SAG1405, SAG1406, 01520, 01521, 01522, 01523, 01523, 01524 and 01525. In one 
embodiment, the GBS AI-2 proteins are selected from the group consisting of GBS 67, GBS 59, GBS 
150, SAG 1405, and SAG1406. In another embodiment, the GBS AI-2 proteins may be selected from 

20 the group consisting of 01520, 01521, 01522, 01523, 01523, 01524 and 01525. Preferably, the GBS 
AI-2 protein includes GBS 59 or GBS 67. 

As another example, the invention includes an immunogenic composition comprising one or 
more of any combination of GAS AI-1, GAS AI-2, GAS AI-3, or GAS AI-4 proteins. One or more of 
the GAS AI proteins may be a sortase. The GAS AI-1 proteins may be selected from the group 

25 consisting of M6_Spy01 56, M6_Spy0157, M6_Spy0158, M6_Spy0159, M6_Spy0160, M6_Spy0161, 
CDC SS 410_fimbrial, ISS3650_fimbrial, and DSM2071 Jfimbrial. Preferably, the GAS AI-1 
proteins are selected from the group consisting of M6_Spy0157, M6_Spy0159, M6_Spy0160, CDC 
SS 410_fxmbrial, IS S3 65 0__fimbrial, and DSM2071 Jimbrial. 

The GAS AI-2 proteins may be selected from the group consisting of Spy0124, GAS15, 

30 Spy0127, GAS 16, GAS 17, GAS 18, Spy0131, Spy0133, and GAS20. Preferably, the GAS AI-2 
proteins are selected from the group consisting of GAS 15, GAS 16, and GAS 18. 

The GAS AI-3 proteins may be selected from the group consisting of SpyM3_0097, 
SpyM3_0098, SpyM3_0099, SpyM3_0100, SpyM3_0101, SpyM3_0102, SpyM3J)103, 
SpyM3_0104, SPs0099, SPsOlOO, SPsOlOl, SPs0102, SPs0103, SPs0104, SPs0105, SPs0106, orf77, 

35 orf78, orf79, orf80, orf81, orf82, orf83, orf84, spyM 18_01 25, spyM18_0126, spyM18_0127, 

spyM18_0128, spyM18_0129, spyM18__0130, spyM18_0131, spyM18J)132, SpyoM01000156, 

SpyoM01000155, SpyoMO 1000 154, SpyoMO 1000 153, SpyoMO 1000 152, SpyoM01000151, 

SpyoMO 1000 150, SpyoMO 1000 149, ISS3040Jimbrial, ISS3776_fimbrial, and ISS4959Jimbrial. In 
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one embodiment the GAS AI-3 proteins are selected from the group consisting of SpyM3_0097, 
SpyM3_0098, SpyM3J)099, SpyM3_0100, SpyM3_0101, SpyM3_0102, SpyM3J)103, and 
SpyM3_0104. In another embodiment, the GAS AI-3 proteins are selected from the group consisting 
of SPs0099, SPsOlOO, SPsOlOl, SPs0102, SPs0103, SPs0104, SPs0105, and SPs0106. In yet another 
5 embodiment, the GAS AI-3 proteins are selected from the group consisting of orf77, orf78, orf79, 
orf80, orfBl, orf82, orf83, and orf84. In a further embodiment, the GAS AI-3 proteins are selected 
from the group consisting of spyM18__0125, spyM18J)126, spyM18_0127, spyM18_0128, 
spyM18J)129, spyM18 J)130, spyM18J)13 1, and spyM18_0132. In yet another embodiment the 
GAS AI-3 proteins are selected from the group consisting of SpyoMOlOOOl 56, SpyoMOlOOOl 55, 
10 SpyoMOlOOOl 54, SpyoMOlOOOl 53, SpyoMOlOOOl 52, SpyoM01000151, SpyoMOlOOOl 50, and 
SpyoM01000149. 

The GAS AI-4 proteins may be selected from the group consisting of 19224133, 19224134, 
19224135, 19224136, 19224137, 19224138, 19224139, 19224140, 19224141, 2001 029 6_fimbrial, 
20020069_fimbriai, CDC SS 635jTimbrial, ISS4883_fimbrial, and ISS4538 JTimbrial. Preferably, the 
15 GAS-AI4 proteins are selected from the group consisting of 19224134, 19224135, 19224137, 
19224139, 19224141, 2001 0296 Jfrmbrial, 2 0020069 Jimbrial, CDC SS 635_fimbrial, 
ISS4883_fimbrial, and ISS4538_fimbriaL 

As yet another example, the invention includes an immunogenic composition comprising one 
or more of any combination of S. pneumonaie from TIGR4, S. pneumonaie strain 670, S. pneumonaie 
20 from 19A Hungary 6, S. pneumonaie from 6B Finland 12, S. pneumonaie from 6B Spain 2, & 

pneumonaie from 9V Spain 3, S. pneumonaie from 14 CSR 10, S. pneumonaie from 19F Taiwan 14, 
S. pneumonaie from 23F Taiwan 15, or S. pneumonaie from 23F Poland 16 AI proteins. One or more 
of the AI proteins may be a surface protein. Such surface proteins may contain an LPXTG motif 
(such as LPXTG (SEQ ID NO: 122)) and may bind fibrinogen, fibronectin, or collagen. One or more 
25 . of the AI proteins may be a sortase. 

The S. pneumonaie from TIGR4 AI proteins may be selected from the group consisting of 
SP0462, SP0463, SP0464, SP0465, SP0466, SP0467, SP0468. Preferably, the S. pneumonaie from 
TIGR4 AI proteins include SP0462, SP0463, or SP0464. 

The S. pneumonaie strain 670 AI proteins may be selected from the group consisting of 
30 Orfl_670, Orf3_670, Orf4_670, Orf5_670, Orf6_670, Orf7j570, and Orf8__670. Preferably, the S. 
pneumonaie strain 670 AI proteins include Orf3_670, Orf4_670, or Orf5_670. 

The S. pneumonaie from 19A Hungary 6 AI proteins may be selected from the group 
consisting of ORF2 19AH, ORF3_19AH, ORF4J9AH, ORF5__19AH, ORF6_19AH, ORF7_19AH i 
or ORF8_19AH. 

35 The S. pneumonaie from 6B Finland 12 AI proteins may be selected from the group 

consisting of ORP2_6BF, ORF3J5BF, ORF4_6BF, ORF5_6BF, ORF6J5BF, ORF7_6BF , or 
ORF8 6BF. 
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The & pneumonaie from 6B Spain 2 AI proteins may be selected from the group consisting of 
ORF26BSP, ORF3_6BSP, ORF4J5BSP, ORF5j5BSP, ORF6J5BSP, ORF7J5BSP , or 
ORF8_6BSP. 

The S. pneumonaie from 9V Spain 3 AI proteins may be selected from the group consisting of 
5 ORF2_9VSP, ORF3_9VSP, ORF4__9VSP, ORF5_9VSP, ORF6_9VSP, ORF7_9VSP , or 
ORF8_9VSP. 

The S. pneumonaie from 14 CSR 10 AI proteins may be selected from the group consisting of 
ORF2__14CSR, ORF3__14CSR, ORF4_14CSR, ORF5_14CSR, ORF6J4CSR, ORF7__14CSR , or 
ORF8_14CSR 

10 The S. pneumonaie from 19F Taiwan 14 AI proteins may be selected from the group 

consisting of ORF2_19FTW, ORF3 19FTW, ORF4_19FTW, ORF5_19FTW, ORF6_19FTW, 
ORF7_19FTW , or ORF819FTW. 

The S. pneumonaie from 23F Taiwan 1 5 AI proteins may be selected from the group 
consisting of ORF2_23FTW, ORF3J23FTW, ORF4_23FTW, ORF5_23FTW, ORF6_23FTW, 
1 5 ORF7_23FTW > or ORF8_23FTW. 

The S: pneumonaie from 23F Poland 16 AI proteins may be selected from the group 
consisting of ORF2_23FP, ORF3_23FP, ORF4__23FP, ORF5_23FP, ORF6__23FP, ORF7_23FP , or 
ORF8_23FP. 

Preferably, the Gram positive bacteria AI proteins included in the immunogenic compositions 

20 of the invention can provide protection across more than one serotype or strain isolate. For example, 
the immunogenic composition may comprise a first AI protein, wherein the amino acid sequence of 
said AI protein is at least 90% (i.e., at least 90, 91, 92, 93, 94, 95, 96, 97, 98, 99 or 100%) 
homologous to the amino acid sequence of a second AI protein, and wherein said first AI protein and 
said second AI protein are derived from the genomes of different serotypes of a Gram positive 

25 bacteria. The first AI protein may also be homologous to the amino acid sequence of a third AI 

protein, such that the first AI protein, the second AI protein and the third AI protein are derived from 
the genomes of different serotypes of a Gram positive bacteria. The first AI protein may also be 
homologous to the amino acid sequence of a fourth AI protein, such that the first AI protein, the 
second AI protein and the third AI protein are derived from the genomes of different serotypes of a 

30 Gram positive bacteria. 

For example, preferably, the GBS AI proteins included in the immunogenic compositions of 
the invention can provide protection across more than one GBS serotype or strain isolate. For 
example, the immunogenic composition may comprise a first GBS AI protein, wherein the amino acid 
sequence of said AI protein is at least 90% (Le^ at least 90, 91, 92, 93, 94, 95, 96, 97, 98, 99 or 100%) 

35 homologous to the amino acid sequence of a second GBS AI protein, and wherein said first AI protein 

and said second AI protein are derived from the genomes of different GBS serotypes. The first GBS 

AI protein may also be homologous to the amino acid sequence of a third GBS AI protein, such that 

the first AI protein, the second AI protein and the third AI protein are derived from the genomes of 
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different GBS serotypes. The first AI protein may also be homologous to the amino acid sequence of 
a fourth GBS AI protein, such that the first AI protein, the second AI protein and the third AI protein 
are derived from the genomes of different GBS serotypes. 

The first AI protein may be selected from an AI-1 protein or an AI-2 protein. For example, 
5 the first AI protein may be a GBS AI-1 surface protein such as GBS 80. The amino acid sequence of 
GBS 80 from GBS serotype V, strain isolate 2603 is greater than 90% homologous to the GBS 80 
amino acid sequence from GBS serotype III, strain isolates NEM316 and COH1 and the GBS 80 
amino acid sequence from GBS serotype la, strain isolate A909. 

As another example, the first AI protein may be GBS 104. The amino acid sequence of GBS 
10 104 from GBS serotype V, strain isolate 2603 is greater than 90% homologous to the GBS 104 amino 
acid sequence from GBS serotype III, strain isolates NEM3 16 and COH1, the GBS 104 amino acid 
sequence from GBS serotype la, strain isolate A909, and the GBS 104 amino acid sequence serotype 
II, strain isolate 18RS21. 

Table 12 provides the amino acid sequence identity of GBS 80 and GBS 104 across GBS 
15 serotypes la, lb, II, III, V, and VIIL The GBS strains in which genes encoding GBS 80 and GBS 104 
were identified share, on average, 99.88 and 99.96 amino acid sequence identity, respectively. This 
high degree of amino acid identity indicates that an immunogenic composition comprising a first 
protein of GBS 80 or GBS 104 may provide protection across more than one GBS serotype or strain 
isolate. 

20 Table 12. Conservation of GBS 80 and GBS 104 amino acid sequences 



Serotype 


Strains 


GBS 80 


GBS 104 






cGH 


%AA identity 


cGH 


%AA identity 


la 


090 


+ 


99.79 


+ 


100.00 


A909 


+ 


100.00 


+ 


100.00 


515 










DK1 










DK8 










Davis 










lb 


7357b 




100.00 


+ 




H36B 










n 


18RS21 






+ 


100.00 


DK21 










III 


NEM316 




100.00 


+ 


100.00 


COH31 


+ 


100.00 


+ 




D136 


+ 


100.00 


+ 




M732 


+ 


100.00 


+ 


99.88 


com 


+ 


99.79 


+ 


99.88 | 


M781 


+ 


99.79 


+ 


99.88 i 


No type 


CJB110 


+ 


99.37 




100.00 


1169NT 










V 


CJB111 


+ 


100.00 


+ 


100.00 


2603 


+ 


100.00 


+ 


100.00 


vni 


JM130013 


+ 


99.79 




100.00 


SMU014 


4- 


100.00 
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Serotype Strains 



GBS 80 



GBS 104 



PCT/US2005/027239 



cGH 



%AA identity 



cGH 



%AA identity 



total 14/22 



99.88+/-0.19 



15/22 



99.96 +/-0.056 



As another example, the first AI protein may be an AI-2 protein such as GBS 67. The amino 
acid sequence of GBS 67 from GBS serotype V, strain isolate 2603 is greater than 90% homologous 
to the GBS 67 amino acid sequence from GBS serotype III, strain isolate NEM3 16, the GBS 67 amino 
acid sequence from GBS serotype lb, strain isolate H36B, and the GBS 67 amino acid sequence from 
GBS serotype II, strain isolate 17RS21. 

As another example, the first AI protein may be an AI-2 protein such as spbl . The amino 
acid sequence of spbl from GBS serotype III, strain isolate COH1 is greater than 90% homologous to 
the spbl amino acid sequence from GBS serotype Ia, strain isolate A909. 

As yet another example, the first AI protein may be an AI-2 protein such as GBS 59. The 
amino acid sequence of GBS 59 from GBS serotype II, strain isolate 18RS21 is 100% homologous to 
the GBS 59 amino acid sequence from GBS serotype V, strain isolate 2603. The amino acid sequence 
of GBS 59 from GBS serotype V, strain isolate CJB1 1 1 is 98% homologous to the GBS 59 amino 
acid sequence from GBS serotype III, strain isolate NEM316. 

The compositions of the invention may also be designed to include Gram positive AI proteins 
from divergent serotypes or strain isolates, i.e., to include a first AI protein which is present in one 
collection of serotypes or strain isolates of a Gram positive bacteria and a second AI protein which is 
present in those serotypes or strain isolates not represented by the first AI protein. 

For example, the invention may include an immunogenic composition comprising a first and 
second Gram positive bacteria AI protein, wherein a polynucleotide sequence encoding for the full 
length sequence of the first AI protein is not present in a similar Gram positive bacterial genome 
comprising a polynucleotide sequence encoding for the second AI protein. 

The compositions of the invention may also be designed to include AI proteins from 
divergent GBS serotypes or strain isolates, i.e., to include a first AI protein which is present in one 
collection of GBS serotypes or strain isolates and a second AI protein which is present in those 
serotypes or strain isolates not represented by the first AI protein. 

For example, the invention may include an immunogenic composition comprising a first and 
second GBS AI protein, wherein a polynucleotide sequence encoding for the full length sequence of 
the first GBS AI protein is not present in a genome comprising a polynucleotide sequence encoding 
for the second GBS AI protein. For example, the first AI protein could be GBS 80 (such as the GBS 
80 sequence from GBS serotype V, strain isolate 2603). As previously discussed (and depicted in 
Figure 2), the sequence for GBS 80 in GBS sertoype II, strain isolate 18RS21 is disrupted. In this 
instance, the second AI protein could be GBS 104 or GBS 67 (sequences selected from the GBS 
serotype II, strain isolate 18RS21). 
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Further, trie the invention may include an immunogenic composition comprising a first and 
second GBS AI protein, wherein the first GBS AI protein has detectable surface exposure on a first 
GBS strain or serotype but not a second GBS strain or serotype and the second GBS AI protein has 
detectable surface exposure on a second GBS strain or serotype but not a first GBS strain or serotype. 
5 For example, the first AI protein could be GBS 80 and the second AI protein could be GBS 67. As 

seen in Table 15, there are some GBS serotypes and strains that have surface exposed GBS 80 but that 
do not have surface exposed GBS 67 and vice versa. An immunogenic composition comprising a 
GBS 80 and a GBS 67 protein may provide protection across a wider group of GBS strains and 
serotypes. 
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Tab le 15: Antigen surface exposure of GBS 80, GBS 322, GBS 104, and GB S 67 
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Alternatively, the invention may include an immunogenic composition comprising a first and 
second Gram positive bacteria AI protein, wherein the polynucleotide sequence encoding the 
sequence of the first AI protein is less than 90 % (i.e., less than 90, 88, 86, 84, 82, 80, 78, 76, 74, 72, 
70, 65, 60, 55, 50, 45, 40, 35 or 30 percent) homologous than the corresponding sequence in the 
genome of the second AI protein. 
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The invention may include an immunogenic composition comprising a first and second GBS 
AI protein, wherein the polynucleotide sequence encoding the sequence of the first GBS AI protein is 
less than 90 % (i.e., less than 90, 88, 86, 84, 82, 80, 78, 76, 74, 72, 70, 65, 60, 55, 50, 45, 40, 35 or 30 
percent) homologous than the corresponding sequence in the genome of the second GBS AI protein. 
5 For example, the first GBS AI protein could be GBS 67 (such as the GBS 67 sequence from GBS 

serotype lb, strain isolate H36B). As shown in Figures 2 and 4, the GBS 67 sequence for this strain is 
less than 90% homologous (87%) to the corresponding GBS 67 sequence in GBS serotype V, strain 
isolate 2603. In this instance, the second GBS AI protein could then be the GBS 80 sequence from 
GBS serotype V, strain isolate 2603. 

10 An example immunogenic composition of the invention may comprise adhesin island proteins 

GBS 80, GBS 104, GBS 67, and GBS 59, and non-AI protein GBS 322. FACS analysis of different 
GBS strains demonstrates that at least one of these five proteins is always found to be expressed on 
the surface of GBS bacteria. An initial FACS analysis of 70 strains of GBS bacteria, obtained from 
the CDC in the United States (33 strains), ISS in Italy (17 strains), and Houston/Harvard (20 strains), 

15 detected surface exposure of at least one of GBS 80, GBS 104, GBS 322, GBS 67, or GBS 59 on the 
surface of the GBS bacteria. Figure 227 provides the FACS data obtained for surface exposure of 
GBS 80, GBS 104, GBS 67, GBS 322, and GBS 59 on each of 37 GBS strains. Figure 228 provides 
the FACS data obtained for surface exposure of GBS 80, GBS 104, GBS 67, GBS 322, and GBS 59 
on each of 41 GBS strains obtained from the CDC. As can be seen from Figures 227 and 228, each 

20 GBS strain had surface expression of at least one of GBS 80, GBS 104, GBS 67, GBS 322, and GBS 
59. The surface exposure of at least one of these proteins on each bacterial strain indicates that an 
immunogenic composition comprising these proteins will provide wide protection across GBS strains 
and serotypes. 

The surface exposed GBS 80, GBS 104, GBS 67, GBS 322, and GBS 59 proteins are also 
25 present at high levels as determined by FACS. Table 49 summarizes the FACS results for the initial 
70 GBS strains examined for GBS 80, GBS 104, GBS 67, GBS 322, and GBS 59 surface expression. 
A protein was designated as having high levels of surface expression of a protein if a five-fold shift in 
fluorescence was observed when using antibodies for the protein relative to preimmune control serum. 
Table 49: Exposure Levels of GBS 80, GBS 104, GBS 67, GBS 322, and GBS 59 on GBS Strains 



5-fold shift in 
fluorescence 
by FACS 


GBS 80 


GBS 104 


GBS 67 


GBS 59 


GBS 322 
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14/70 


49/70 


46/70 


33/70 


24% 


20% 


70% 


66% 


47% 



30 Table 50 details which of the surface proteins is highly expressed on the different GBS serotype. 



Table 50: High Levels of Surface Protein Expression on GBS Serotypes 
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Alternatively, the immunogenic composition of the invention may include GBS 80, GBS 104, 
GBS 67, and GBS 322. Assuming that protein antigens that are highly accessible to antibodies confer 
100% protection with suitable adjuvants, an immunogenic composition containing GBS 80, GBS 104, 
GBS 67, GBS 59 and GBS 322 will provide protection for 89% of GBS strains and serotypes, the 
5 same percentage as an immunogenic composition containing GBS 80, GBS 104, GBS 67, and GBS 
322 proteins. See Figure 229. However, it may be preferable to include GBS 59 in the composition 
to increase its immunogenic strength. As seen from Table 50, GBS 59 is highly expressed on the 
surface two-thirds of GBS bacteria examined by FACS analysis, unlike GBS 80, GBS 104, and GBS 
322, which are highly expressed in less than half of GBS bacteria examined. GBS 59 

10 opsonophagocytic activity is also comparable to that of a mix of GBS 322, GBS 104, GBS 67, and 
GBS 80 proteins. See Figure 230. 

By way of another example, preferably, the GAS AI proteins included in the immunogenic 
compositions of the invention can provide protection across more than one GAS serotype or strain 
isolate. For example, the immunogenic composition may comprise a first GAS AI protein, wherein 

15 the amino acid sequence of said AI protein is at least 90% (i.e., at least 90, 91, 92, 93, 94, 95, 96, 97, 
98, 99 or 100%) homologous to the amino acid sequence of a second GAS AI protein, and wherein 
said first AI protein and said second AI protein are derived from the genomes of different GAS 
serotypes. The first GAS AI protein may also be homologous to the amino acid sequence of a third 
GAS AI protein, such that the first AI protein, the second AI protein and the third AI protein are 

20 derived from the genomes of different GAS serotypes. The first AI protein may also be homologous 
to the amino acid sequence of a fourth GAS AI protein, such that the first AI protein, the second AI 
protein and the third AI protein are derived from the genomes of different GAS serotypes. 

The compositions of the invention may also be designed to include GAS AI proteins from 
divergent serotypes or strain isolates, i.e., to include a first AI protein which is present in one 

25 collection of serotypes or strain isolates of a GAS bacteria and a second AI protein which is present in 
those serotypes or strain isolates not represented by the first AI protein. 

For example, the first AI protein could be a prtF2 protein (such as the 19224141 protein from 
GAS serotype M12, strain isolate A735). As previously discussed (and depicted in Figure 164), the 
sequence for a prtF2 protein is not present in GAS AI types 1 or 2. In this instance, the second AI 

30 protein could be collagen binding protein M6_Spy0159 (from M6 isolate (MGAS10394), which 
comprises an AI-1) or GAS 15 (from Ml isolate (SF370), which comprises an AI-2). 

Further, the invention may include an immunogenic composition comprising a first and 
second GAS AI protein, wherein the first GAS AI protein has detectable surface exposure on a first 
GAS strain or serotype but not a second GAS strain or serotype and the second GAS AI protein has 

35 detectable surface exposure on a second GAS strain or serotype but not a first GAS strain or serotype. 

The invention may include an immunogenic composition comprising a first and second GAS 
AI protein, wherein the polynucleotide sequence encoding the sequence of the first GAS AI protein is 
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percent) homologous than the corresponding sequence in the genome of the second GAS AI protein. 
Preferably the first and second GAS AI proteins are subunits of the pilus. More preferably the first 
and second GAS AI proteins are selected from the major pilus forming proteins (i.e., M6_Spy0160 
5 from M6 strain 10394, SPy0128 from Ml strain SF370, SpyM3 J)100 from M3 strain 315, SPs0102 
from M3 strain SSI, orf80 from M5 isolate Manfredo, spyM18_0128 from Ml 8 strain 8232, 
SpyoM01000153 from M49 strain 591, 19224137 from M12 strain A735, fimbrial structural subunit 
from M77 strain ISS4959, fimbrial structural subunit from M44 strain ISS3776, fimbrial structural 
subunit from M50 strain ISS3776 ISS 4538, fimbrial structural subunit from M12strain CDC SS635, 

10 fimbrial structural subunit from M23 strain DSM2071, fimbrial structural subunit from M6 strain 

CDC SS410). Table 45 provides the percent identity between the amino acidic sequences of each of 
the main pilus forming subunits from GAS AI-1, AI-2, AI-3, and AI-4 representative strains (i.e., 
M6_Spy0160 from M6 strain 10394, SPy0128 from Ml strain SF370, SpyM3_0100 fromM3 strain 
315, SPs0102 from M3 strain SSI, orf80 from M5 isolate Manfredo, spyM18J)128 from M18 strain 

15 8232, SpyoM01000153 from M49 strain 591, 19224137 from M12 strain A735, Fimbrial structural 
subunit from M77 strain ISS4959, fimbrial structural subunit from M44 strain ISS3776, fimbrial 
structural subunit from M50 strain ISS3776 ISS 4538, fimbrial structural subunit from M12strain 
CDC SS635, fimbrial structural subunit from M23 strain DSM2071, fimbrial structural subunit from 
M6 strain CDC SS410). 

20 Table 45: Comparison of Amino Acid Sequences of Major Pilus Proteins in the Four GAS 
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For example, the first main pilus subunit may be selected from bacteria of GAS serotype M6 
strain 10394 and the second main pilus subunit may be selected from bacteria of GAS serotype Ml 
strain 370. As can be seen from Table 45, the main pilus subunits encoded by these strains of bacteria 
share only 23% nucleotide identity. An immunogenic composition comprising pilus main subunits 
5 from each of these strains of bacteria is expected to provide protection across a wider group of GAS 
strains and serotypes. Other examples of main pilus subunits that can be used in combination to 
provide increased protection across a wider range of GAS strains and serotypes include proteins 
encoded by GAS serotype M5 Manfredo isolate and serotype M6 strain 10394, which share 23% 
sequence identity, GAS serotype Ml 8 strain 8232 and serotype Ml strain 370, which share 38% 
10 sequence identity, GAS serotype M3 strain 315 and serotype Ml 2 strain A735, which share 61% 

sequence identity, and GAS serotype M3 strain 315 and serotype M6 strain 10394 which share 25% 
sequence identity. 

As also can be seen from Table 45, the amino acid sequences of the four types of main pilus 
subunits present in GAS are relatively divergent. Figures 198-201 provide further tables comparing 
15 the percent identity of adhesin island-encoded surface exposed proteins for different GAS serotypes 
relative to other GAS serotypes harbouring an adhesin island of the same or a different subtype (GAS 
AI-1, GAS AI-2, GAS AI-3, and GAS AI-4). See also further discussion below. 

Immunizations with the Adhesin Island proteins of the invention are discussed further in the 
Examples. 

20 1 Co-expression of GBS Adhesin Island proteins and role of GBS AI proteins in surface presentation 
In addition to the use of the GBS adhesin island proteins for cross strain and cross serotype 
protection, Applicants have identified interactions between adhesin island proteins which appear to 
affect the delivery or presentation of the surface proteins on the surface of the bacteria. 

In particular, Applicants have discovered that surface exposure of GBS 104 is dependent on 
25 the concurrent expression of GBS 80. As discussed further in Example 2, reverse transcriptase PCR 
analysis of AI-1 shows that all of the AI genes are co -transcribed as an operon. Applicants 
constructed a series of mutant GBS containing in frame deletions of various AI-1 genes. (A 
schematic of the GBS mutants is presented in Figure 7). FACS analysis of the various mutants 
comparing mean shift values using anti-GBS 80 versus anti-GBS 104 antibodies is presented in 
30 Figure 8. Removal of the GBS 80 operon prevented surface exposure of GBS 104; removal of the 
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ibsure of GBS 80. While not being limited to a specific 



theory, it is thought that GBS 80 is involved in the transport or localization of GBS 104 to the surface 
of the bacteria. The two proteins may be oligomerized or otherwise associated. It is possible that this 
association involves a conformational change in GBS 104 that facilitates its transition to the surface of 
the GBS bacteria. 

Pili structures that comprise GBS 104 appear to be of a lower molecular weight than pili 
structures lacking GBS 104. Figure 68 shows that polyclonal anti-GBS 104 antibodies (see lane 
marked a- 104 POLIC.) cross-hybridize with smaller structures than do polyclonal anti-GBS 80 
antibodies (see lane marked a-GBS 80 POLIC.). 

In addition, Applicants have shown that removal of GBS 80 can cause attenuation, further 
suggesting the protein contributes to virulence. As described in more detail in Example 3, the LD 50 's 
for the A80 mutant and the A80, A104 double mutant were reduced by an order of magnitude 
compared to wildtype and A 104 mutant. 

The sortases within the adhesin island also appear to play a role in localization and 
presentation of the surface proteins. As discussed further in Example 4, FACS analysis of various 
sortase deletion mutants showed that removal of sortase SAG0648 prevented GBS 104 from reaching 
the surface and slightly reduced the surface exposure of GBS 80. When sortase SAG0647 and sortase 
SAG0648 were both knocked out, neither GBS 80 nor GBS 104 were surface exposed. Expression of 
either sortase alone was sufficient for GBS 80 to arrive at the bacterial surface. Expression of 
SAG0648, however, was required for GBS 104 surface localization. 

Accordingly, the compositions of the invention may include two or more AI proteins, wherein 
the AI proteins are physically or chemically associated. For example, the two AI proteins may form 
an oligomer. In one embodiment, the associated proteins are two AI surface proteins, such as GBS 80 
and GBS 104. The associated proteins may be AI surface proteins from different adhesin islands, 
including host cell adhesin island proteins if the AI surface proteins are expressed in a recombinant 
system. For example, the associated proteins may be GBS 80 and GBS 67. 
Adhesin Island proteins from other Gram positive bacteria 

Applicants' identification and analysis of the GBS adhesin islands and the immunological and 
biological functions of these AI proteins and their pilus structures provides insight into similar 
structures in other Gram positive bacteria. 

As discussed above, "Adhesin Island" or "AI" refers to a series of open reading frames within 
a bacterial genome that encode for a collection of surface proteins and sortases. An Adhesin Island 
may encode for amino acid sequences comprising at least one surface protein. The Adhesin Island 
may encode at least one surface protein. Alternatively, an Adhesin Island may encode for at least two 
surface proteins and at least one sortase. Preferably, an Adhesin Island encodes for at least three 
surface proteins and at least two sortases. One or more of the surface proteins may include an 
LPXTG motif (such as LPXTG (SEQ ID NO: 122)) or other sortase substrate motif. One or more AI 
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surface proteins may participate in the lormation of a pilus structure on the surface of the Gram 

positive bacteria. 

Gram positive adhesin islands of the invention preferably include a divergently transcribed 
transcriptional regulator. The transcriptional regulator may regulate the expression of the AI operon. 
5 The invention includes a composition comprising one or more Gram positive bacteria AI 

surface proteins. Such AI surface proteins may be associated in an oligomeric or hyperoligomeric 
structure. 

Preferred Gram positive adhesin island proteins for use in the invention may be derived from 
Staphylococcus (such as S. aureus), Streptococcus (such as S. agalactiae (GBS), S. pyogenes (GAS), 
10 S. pneumonaie, S. mutans), Enterococcus (such as E.faecalis and E. faecium), Clostridium (such as C. 
difficile), Listeria (such as E monocytogenes) and Cory neb acterium (such as C. diphtheria). 

One or more of the Gram positive AI surface protein sequences typically include an LPXTG 
motif or other sortase substrate motif. Gram positive AI surface proteins of the invention may affect 
the ability of the Gram positive bacteria to adhere to and invade epithelial cells. AI surface proteins 
15 may also affect the ability of Gram positive bacteria to translocate through an epithelial cell layer. 

Preferably, one or more AI surface proteins are capable of binding to or otherwise associating with an 
epithelial cell surface. Gram positive AI surface proteins may also be able to bind to or associate with 
fibrinogen, fibronectin, or collagen. 

Gram positive AI sortase proteins are predicted to be involved in the secretion and anchoring 
20 of the LPXTG containing surface proteins. A Gram positive bacteria AI may encode for at least one 
surface exposed protein. The Adhesin Island may encode at least one surface protein. Alternatively, 
a Gram positive bacteria AI may encode for at least two surface exposed proteins and at least one 
sortase. Preferably, a Gram positive AI encodes for at least three surface exposed proteins and at least 
two sortases. 

25 Gram positive AI surface proteins may be covalently attached to the bacterial cell wall by 

membrane- associated transpeptidases, such as an AI sortase. The sortase may function to cleave the 
surface protein, preferably between the threonine and glycine residues of an LPXTG motif. The 
sortase may then assist in the formation of an amide link between the threonine carboxyl group and a 
cell wall precursor such as lipid II. The precursor can then be incorporated into the peptidoglycan via 

30 the transglycoslylation and transpeptidation reactions of bacterial wall synthesis. See Comfort et aL, 
Infection & Immunity (2004) 72(5): 2710 - 2722. Typically, Gram positive bacteria AI surface 
proteins of the invention will contain an N-terminal leader or secretion signal to facilitate 
translocation of the surface protein across the bacterial membrane. 

Gram positive bacteria AI surface proteins of the invention may affect the ability of the Gram 

35 positive bacteria to adhere to and invade target host cells, such as epithelial cells. Gram positive 
bacteria AI surface proteins may also affect the ability of the gram positive bacteria to translocate 
through an epithelial cell layer. Preferably, one or more of the Gram positive AI surface proteins are 
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capable of binding to or other associating with an epithelial cell surface. Further, one or more Gram 
positive AI surface proteins may bind to fibrinogen, fibronectin, or collagen protein. 

In one embodiment, the invention includes a composition comprising oligomeric, pilus-like 
structures comprising a Gram positive bacteria AI surface protein. The oligomeric, pilus-like 
5 structure may comprise numerous units of the AI surface protein. Preferably, the oligomeric, pilus- 
like structures comprise two or more AI surface proteins. Still more preferably, the oligomeric, pilus- 
like structure comprises a hyper-oligomeric pilus-like structure comprising at least two (e.g., 2, 3, 4, 
5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 120, 140, 150, 200 
or more) oligomeric subunits, wherein each subunit comprises an AI surface protein or a fragment 

10 thereof. The oligomeric subunits may be covalently associated via a conserved lysine within a pilin 

motif. The oligomeric subunits may be covalently associated via an LPXTG motif, preferably, via the 
threonine amino acid residue. 

Gram positive bacteria AI surface proteins or fragments thereof to be incorporated into the 
oligomeric, pilus-like structures of the invention will preferably include one or both of a pilin motif 

15 comprising a conserved lysine residue and an E box motif comprising a conserved glutamic acid 
residue. 

The oligomeric, pilus like structures may be used alone or in the combinations of the 
invention. In one embodiment, the invention comprises a Gram positive bacteria Adhesin Island in 
oligomeric form, preferably in a hyperoligomeric form. 

20 The oligomeric, pilus-like structures of the invention may be combined with one or more 

additional Gram positive AI proteins (from the same or a different Gram positive species or genus). 
In one embodiment, the oligomeric, pilus-like structures comprise one or more Gram positive bacteria 
AI surface proteins in combination with a second Gram positive bacteria protein. The second Gram 
positive bacteria protein may be a known antigen, and need not normally be associated with an AI 

25 protein. 

The oligomeric, pilus-like structures may be isolated or purified from bacterial cultures 
overexpressing a Gram positive bacteria AI surface protein. The invention therefore includes a 
method for manufacturing an oligomeric Adhesin Island surface antigen comprising culturing a Gram 
positive bacteria adapted for increased AI protein expression and isolation of the expressed oligomeric 
30 Adhesin Island protein from the Gram positive bacteria. The AI protein may be collected from 
secretions into the supernatant or it may be purified from the bacterial surface. The method may 
further comprise purification of the expressed Adhesin Island protein. Preferably, the Adhesin Island 
protein is in a hyperoligomeric form. 

Gram positive bacteria are preferably adapted to increase AI protein expression by at least 
35 two (e.g., 2, 3, 4, 5, 8, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 125, 150 or 200) times 
wild type expression levels. 

Gram positive bacteria may be adapted to increase AI protein expression by means known in 

the art, including methods of increasing gene dosage and methods of gene upregulation. Such means 
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include, for example, transformation of the Gram positive bacteria with a plasmid encoding the AI 
protein. The plasmid may include a strong promoter or it may include multiple copies of the sequence 
encoding the AI protein. Optionally, the sequence encoding the AI protein within the Gram positive 
bacterial genome may be deleted. Alternatively, or in addition, the promoter regulating the Gram 
5 positive Adhesin Island may be modified to increase expression. 

The invention further includes Gram positive bacteria which have been adapted to produce 
increased levels of AI surface protein. In particular, the invention includes Gram positive bacteria 
which have been adapted to produce oligomeric or hyperoligomeric AI surface protein. In one 
embodiment, the Gram positive bacteria of the invention are inactivated or attenuated to permit in 

10 vivo delivery of the whole bacteria, with the AI surface protein exposed on its surface. 

The invention further includes Gram positive bacteria which have been adapted to have 
increased levels of expressed AI protein incorporated in pili on their surface. The Gram positive 
bacteria may be adapted to have increased exposure of oligomeric or hyperoligomeric AI proteins on 
its surface by increasing expression levels of a signal peptidase polypeptide. Increased levels of a 

1 5 local signal peptidase expression in Gram positive bacteria (such us LepA in GAS) are expected to 
result in increased exposure of pili proteins on the surface of Gram positive bacteria. Increased 
expression of a leader peptidase in Gram positive may be achieved by any means known in the art, 
such as increasing gene dosage and methods of gene upregulation. The Gram positive bacteria 
adapted to have increased levels of leader peptidase may additionally be adapted to express increased 

20 levels of at least one pili protein. 

Alternatively, the AI proteins of the invention may be expressed on the surface of a non- 
pathogenic Gram positive bacteria, such as Streptococus gordonii (See, e.g., Byrd et al., "Biological 
consequences of antigen and cytokine co-expression by recombinant Streptococcus gordonii vaccine 
vectors", Vaccine (2002) 20:2197-2205) or Lactoco ecus lactis (See, e.g., Mannam et al., "Mucosal 

25 VaccineMade from Live, Recombinant Lactococcus lactis Protects Mice against Pharangeal Infection 
with Streptococcus pyogenes" Infection and Immunity (2004) 72(6): 3 444-3 450). It has already been 
demonstrated, above, that L. lactis expresses GBS and GAS AI polypeptides in oligomeric form and 
on its surface. 

Alternatively, the oligomeric, pilus-iike structures may be produced recombinantly. If 
30 produced in a recombinant host cell system, the Gram positive bacteria AI surface protein will 

preferably be expressed in coordination with the expression of one or more of the AI sortases of the 
invention. Such AI sortases will facilitate oligomeric or hyperoligomeric formation of the AI surface 
protein subunits. 

Gram positive AI Sortases of the invention will typically have a signal peptide sequence 
35 within the first 70 amino acid residues. They may also include a transmembrane sequence within 50 
amino acid residues of the C terminus. The sortases may also include at least one basic amino acid 
residue within the last 8 amino acids. Preferably, the sortases have one or more active site residues, 
such as a catalytic cysteine and histidine. 
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i} , '" > ' Adhesin island surfa'& ? proteins from two or more Gram positive bacterial genus or species 
may be combined to provide an immunogenic composition for prophylactic or therapeutic treatment 
of disease or infection of two more Gram positive bacterial genus or species. Optionally, the adhesin 
island surface proteins may be associated together in an oligomeric or hyperoligomeric structure. 
5 In one embodiment, the invention comprises an adhesin island surface proteins from two or 

more Streptococcus species. For example, the invention includes a composition comprising a GBS AI 
surface protein and a GAS adhesin island surface protein. As another example, the invention includes 
a composition comprising a GAS adhesin island surface protein and a S. pneumoniae adhesin island 
surface protein. 

10 In one embodiment, the invention comprises an adhesin island surface protein from two or 

more Gram positive bacterial genus. For example, the invention includes a composition comprising a 
Streptococcus adhesin island protein and a Corynebacterium adhesin island protein. 

Examples of AI sequences in several Gram positive bacteria are discussed further below. 
Streptococcus pyogenes (GAS) 

15 As discussed above, Applicants have identified at least four different GAS Adhesin Islands. 

These adhesion islands are thought to encode surface proteins which are important in the bacteria's 
virulence, and Applicants have obtained the first electron micrographs revealing the presence of these 
adhesin island proteins in hyperoligomeric pilus structures on the surface of Group A Streptococcus. 

Group A Streptococcus is a human specific pathogen which causes a wide variety of diseases 

20 ranging from pharyngitis and impetigo through life threatening invasive disease and necrotizing 
fasciitis. In addition, post-streptococcal autoimmune responses are still a major cause of cardiac 
pathology in children. 

Group A Streptococcal infection of its human host can generally occur in three phases. The 
first phase involves attachment and/or invasion of the bacteria into host tissue and multiplication of 

25 the bacteria within the extracellular spaces. Generally this attachment phase begins in the throat or 
the skin. The deeper the tissue level infected, the more severe the damage that can be caused. In the 
second stage of infection, the bacteria secrete a soluble toxin that diffuses into the surrounding tissue 
or even systemically through the vasculature. This toxin binds to susceptible host cell receptors and 
triggers innappropropriate immune responses by these host cells, resulting in pathology. Because the 

30 toxin can diffuse throughout the host, the necrosis directly caused by the GAS toxins may be 

physically located in sites distant from the bacterial infection. The final phase of GAS infection can 
occur long after the original bacteria have been cleared from the host system. At this stage, the host's 
previous immune response to the GAS bacteria due to cross reactivity between epitopes of a GAS 
surface protein, M, and host tissues, such as the heart. A general review of GAS infection can be 

35 found in Principles of Bacterial Pathogeneis, Groisman ed., Chapter 15 (2001). 

In order to prevent the pathogenic effects associated with the later stages of GAS infection, an 
effective vaccine against GAS will preferably facilitate host elimination of the bacteria during the 
initial attachment and invasion stage. 
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' " , " Isolates of Group" A Streptococcus are historically classified according to the M surface 
protein described above. The M protein is surface exposed trypsin- sensitive protein generally 
comprising two polypeptide chains complexed in an alpha helical formation. The carboxyl terminus 
is anchored in the cytoplasmic membrane and is highly conserved among all group A streptococci. 
5 The amino terminus, which extends through the cell wall to the cell surface, is responsible for the 
antigenic variability observed among the 80 or more serotypes of M proteins. 

A second layer of classification is based on a variable, trypsin-resistant surface antigen, 
commonly referred to as the T-antigen. Decades of epidemiology based on M and T serological 
typing have been central to studies on the biological diversity and disease causing potential of Group 

10 A Streptococci. While the M-protein component and its inherent variability have been extensively 
characterized, even after five decades of study, there is still very little known about the structure and 
variability of T-antigens. Antisera to define T types are commercially available from several sources, 
including Sevapharma (http://www.sevapharma.cz/en). 

The gene coding for one form of T-antigen, T-type 6, from an M6 strain of GAS (D741) has 

15 been cloned and characterized and maps to an approximately 1 1 kb highly variable pathogenicity 
island. Schneewind et aL, J Bacteriol. (1990) 172(6):33 10 - 33 17. This island is known as the 
Fibronectin-binding, Collagen-binding T-antigen (FCT) region because it contains, in addition to the 
T6 coding gene (tee6), members of a family of genes coding for Extra Cellular Matrix (ECM) binding 
proteins. Bessen et aL, Infection & Immunity (2002) 70(3): 1 159-1 167. Several of the protein 

20 products of this gene family have been shown to directly bind either fibronectin and/or collagen. See 
Hanski et aL, Infection & Immunity (1992) 60(12):51 19-5125; Talay et aL, Infection & Immunity 
(1992( 60(9):3837-3844; Jaffe et al. (1996) 21(2):373-384; Rocha et aL, Adv Exp Med Biol. (1997) 
418:737-739; Kreikemeyer et aL, J Biol Chem (2004) 279(16): 15850-15859; Podbielski et aL, Mol. 
Microbiol. (1999) 3 1(4): 105 1-64; and Kreikemeyer et aL, Int. J. Med Microbiol (2004) 294(2-3):177- 

25 88. In some cases direct evidence for a role of these proteins in adhesion and invasion has been 
obtained. 

Applicants raised antiserum against a recombinant product of the tee6 gene and used it to 
explore the expression of T6 in M6 strain ISS3650. In immunoblot of mutanolysin extracts of this 
strain, the antiserum recognized, in addition to a band corresponding to the predicted molecular mass 
30 of the tee6 gene product, very high molecular weight ladders ranging in mobility from about 1 00 kDa 
to beyond the resolution of the 3-8% gradient gels used. See Figure 163 A, last lane labeled 
"M6_Tee6." 

This pattern of high molecular weight products is similar to that observed in immunoblots of 
the protein components of the pili identified in Streptococcus agalactiae (described above) and 
35 previously in Corynebacterium diphtheriae. Electron microscropy of strain M6 IS S3 650 with antisera 
specific for the product of tee6 revealed abundant surface staining and long pilus like structures 
extending up to 700 nanometers from the bacterial surface, revealing that the T6 protein, one of the 
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aritigens reco^ized in the original Lancefield serotyping system, is located within a GAS Adhesin 
Island (GAS AI-1) and forms long covalently linked pilus structures. See Figure 1631. 

In addition to the tee6 gene, the FCT region in M6_ISS3650 (GAS AI-1) contains two 
other genes (prtFl and cpa) predicted to code for surface exposed proteins; these proteins are 
5 characterized as containing the cell wall attachment motif LPXTG. Western blot analysis using 
antiserum specific for PrtFl detected a single molecular species with electrophoretic mobility 
corresponding to the predicted molecular mass of the protein and one smaller band of unknown 
origin. Western blot analysis using antisera specific for Cpa recognized a high molecular weight 
covalently linked ladder (Fig 163 A, second lane). Immunogold labelling of Cpa with specific 

10 antiserum followed by transmission electron microscopy detected an abundance of Cpa at the cell 
surface and only occasional structures extending from the cell surface (Fig. 163 J). 

Four classes of FCT region can be discerned by the types and order of the genes 
contained within the region. The FCT region of strains of types M3, M5, Ml 8 and M49 have a 
similar organization whereas those of M6, Ml and Ml 2 differ. See Figure 164. As discussed 

15 below, these four FCT regions correlate to four GAS Adhesin Island types (AI-1, AI-2, AI-3 and 
AI-4). 

Applicants discovery of genes coding for pili in the FCT region of strain M6_ISS3650 
prompted them to examine the predicted surface exposed proteins in the variant FCT regions of 
three other GAS strains of having different M-type (M1_SF370, M5JSS4883 and 

20 Ml 2_200 10296) representing the other three FCT variants. Each gene present in the FCT region 
of each bacteria was cloned and expressed. Antisera specific for each recombinant protein was 
then used to probe mutanolysin extracts of the respective strains (6). hi Ml strain SF370, there 
are three predicted surface proteins (Cpa (also referred to as M1_JL26 and GAS 15), Ml_128 (a 
fimbrial protein also referred to as Spy0128 and GAS 16), and Ml_130 (also referred to as 

25 Spy0130 and GAS 18)) (GAS AI-2). Antisera specific for each surface protein reacted with a 
ladder of high molecular weight material (Fig. 163B). Immunogold staining of Ml strain SF370 
with antiserum specific for Ml_128 revealed pili structures similar to those seen when M6 strain 
ISS3650 was immunogold stained with antiserum specific for tee6 (See Fig 1 163K). Antisera 
specific for surface proteins Cpa and Ml_130 revealed abundant surface staining and occasional 

30 structures extending from the surface of Ml strain SF370 bacteria (Fig. 163S). 

The Ml__128 protein appears to be necessary for polymerization of Cpa and Ml_130 
proteins. If the Ml_128 gene in M1J3F370 was deleted, Western blot analysis using antibodies 
that hybridize to Cpa and Ml_130 no longer detected high molecular weight ladders comprising 
the Cpa and Ml_130 proteins (Fig. 163 E). See also Figures 177 A-C which provide the results 

35 of Western blot analysis of the Ml_128 (A128) deleted bacteria using anti-Ml_130 antiserum 

(Figure 177 A), anti-Ml_128 antiserum (Figure 177 B), and anti-Ml_126 antiserum (Figure 177 
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"indicative of pilus formation on the surface of Ml strain 



SF370, could not be detected by any of the three antisera in A128 bacteria. If the A128 bacteria 
were transformed with a plasmid containing the gene for Ml_128, Western blot analysis using 
antisera specific for Cpa and Ml_130 again detected high molecular weight ladders (Figure 163 



In agreement with the Western blot analysis, immunoelectron microscopy failed to detect 
pilus assembly on the A128 strain SF370 bacteria using Ml_128 antisera (Figure 178 B). 
Although A 128 SF370 bacteria were unable to form pili, Ml_126 (cpa) and Ml_130, which 
contain sortase substrate motifs, were present on the bacteria's surface. FACS analysis of the 

10 Ml_128 deleted (A128) strain SF370 bacteria also detected both Ml_126 and Ml_130 on the 
surface of the A128 strain SF370 bacteria. See Figure 179 D and F, which show a shift in 
fluorescence when antibodies immunoreactive to Ml_126 and Ml_130 are used on A128 
bacteria. As expected, virtually no shift in fluorescence is observed when antibodies 
immunoreactive to M1128 are used with the A128 bacteria (Figure 179 E). 

15 By contrast, deletion of the Ml 130 gene did not effect polymerization of Ml__128 (Fig. 

163 F). See also Figures 177 A-C, which provide Western blot analysis results of the Ml_130 
deleted (A130) strain SF370 bacteria using anti-Ml_130 (Figure 177 A), anti-Ml_128 (Figure 
177 B), and anti-Ml_126 antiserum (Figure 177 C). The anti-Ml__128 and anti-Ml_126 
antiserum both detected the presence of high molecular weight ladders in the A130 strain SF370 

20 bacteria, indicating that the A130 bacteria form pili that comprise Ml_126 and Ml_128 

polypeptides in the absence of Ml_130. As expected, the Western blot probed with antiserum 
immunoreactive with Ml_130 did not detect any proteins for the A130 bacteria(Figure 177A). 

Hence, the composition of the pili in GAS resembles that previously described for both C 
diphtheria (7, 8) and S. agalactiae (described above) (P) in that each pilus is formed by a 

25 backbone component which abundantly stains the pili in EM and is essential for the incorporation 
of the other components. 

Also similar to C. diphtheria, elimination of the srtCl gene from the FCT region of 
M1_SF370 abolished polymerization of all three proteins and assembly of pili (Fig. 163 G). See 
also Figures 177 A-C, which provide Western blot analysis of the SrtCl deleted (ASrtCl) strain 

30 SF370 bacteria using anti-Ml_J30 (Figure 177 A), anti-Ml_128 (Figure 177 B), and anti- 
M1J.26 antiserum (Figure 177 C). None of the three antisera immunoreacted with high 
molecular weight structures (pili) in the ASrtCl bacteria. Confirming that deletion of the SrtCl 
gene abrogates pilus assembly in strain SF370, immunoelectron microscopy using antisera 
against Ml_128 failed to detect pilus formation on the bacteria surface. See Figure 178 C. 

35 Although no assembled pili were detected on ASrtCl SF370, Ml_128 proteins could be detected 
on the surface of SF370. Thus, it appeared that SrtCl deletion prevented pilus assembly on the 
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cell wall. FACS analysis of the ASrtCl strain SF370 confirmed that deletion of SrtCl does not 
eliminate cell surface expression of M1126, Ml_128 or Ml__130. See Figure 179 G-I, which 
show a shift in fluorescence when antibodies immunoreactive to Ml__126 (Figure 179 G), 
5 Ml_128 (Figure 179 H) 5 and Ml JL30 (Figure 179 I) are used to detect cell surface protein 
expression on ASrtCl bacteria. Thus, SrtCl deletion prevents pilus formation, but not surface 
anchoring of proteins involved in pilus formation on the surface of bacteria. Another sortase is 
possibly involved in anchoring of the proteins to the bacteria surface. Pilus polymerization in C. 
diphtheriae is also dependent on particular sortase enzyme whose gene resides at the same 

10 genetic locus as the pilus components (7, 8). 

The LepA signal peptidase, Spy0127, also appears to be essential for pilus assembly in 
strain SF370. LepA deletion mutants (ALepA) of strain SF370 fail to assemble pili on the cell 
surface. Not only are the ALepA mutants unable to assemble pili, they are also deficient at cell 
surface Ml expression. See Figure 180, which provides a FACS analysis of the wildtype (A) and 

1 5 ALepA mutant (B) SF370 bacteria using Ml antisera. No shift in fluorescence is observed for the 
ALepA mutant bacteria in the presence of Ml immune serum. It is possible that these deletion 
mutants of LepA will be useful for detecting non-M, non-pili, surface exposed antigens on the 
surface of GAS, or any Gram positive bacteria. These antigens may also be useful in 
immunogenic compositions. 

20 Pili were also observed in M5 strain ISS4882 and M12 strain 20010296, The M5 strain 

ISS4882 contains genes for four predicted surface exposed proteins (GAS AI-3). Antisera 
against three of the four products of the FCT region (GAS AI-3) of M5_ISS4883 (Cpa, 
M5_orf80, M5_orf82) stained high molecular weight ladders in Western blot analysis (Figure 163 
C). Long pili were visible when antisera against M5_prf80 was used in immunogold staining 

25 followed by electron microscopy (Figure 1 63L). 

The M12 strain 20010296 contains genes for five predicted surface exposed proteins. 
(GAS AI-4) Antisera against three of the five products of the FCT region (GAS AI-4) of 
M12_20010296 (Cpa, EftLSLA, Orf2) stained high molecular weight ladders in Westen blot 
analysis (Figure 163 D). Long pili were visible when antisera against EftLSL.A were used (Fig. 

30 163 M). 

The major pilus forming proteins identified in the four strains studied by applicants (T6, 
Ml_128, M5_orf80 and EftLSL.A) share between 23% and 65% amino acid identity in any pairwise 
comparison, indicating that each pilus may represent a different Lancefield T-antigen. Each pilus is 
part of a trypsin resistant structure on the GAS bacteria surface, as is the case for the Lancefield T 
35 antigens. See Figure 165, which provides a FACS analysis of bacteria harboring each of the FCT 

types that had or had not been treated with trypsin (d). Following treatment, surface expression of the 
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specific for the pilus proteins, the bacteria's respective M proteins, or surface proteins not associated 
with the pili (Figure 165). Staining the cells with sera specific for proteins associated with the pili was 
not effected by trypsin treatment, whereas trypsin treatment substantially reduced detection of M- 
5 proteins or surface proteins not associated with pili. 

The pili structures identified on the surface of the GAS bacteria were confirmed to be 
Lancefield T antigens when commercially available T-serotyping sera detected the pili on the surface 
of bacteria. Western blot analysis was initially performed to determine if polyvalent serum pools 
(designated T, U, W, X, andY) could detect recombinant proteins for each of the major pilis 

10 components (T6, Ml_128, M5_orf80 and EftLSL.A) identified in the strains of bacteria discussed 
above. Pool U, which contains the T6 serum, recognized the T6 protein specifically (a surface 
exposed pilus protein from GAS AI-l)(Fig. 166 B). Pool T specifically recognized Ml_128 (a surface 
exposed pilus protein from GAS AI-2) (Fig. 166 A). Pool W recognized both M5_orf80 and 
EftLSL.A (Fig. 166 C). Using monovalent sera representative of each of the components of each 

15 polyvalent pool, applicants confirmed the specificity of the T6 antigen (corresponding to a surface 
exposed pilus protein from GAS AI-l)(Fig. 166 E) and identified Ml 128 as antigen Tl 
(corresponding to a surface exposed pilus protein from GAS AI-2) (Fig. 166 D), EftLSL.A as antigen 
T12 (corresponding to a surface exposed pilus protein from GAS AI-4) (Fig. 166 G) and M5_orf80 as 
a common antigen recognized by the related sera T5, T27 and T44 (corresponding to a surface 

20 exposed pilus protein from GAS AI-3). 

Confirming applicants observations, discussed above, that deleting the Ml__128 gene from 
M1_SF370 abolishes pilus formation, the pool T sera stained whole Ml JSF370 bacteria (Fig. 166 H) 
but failed to stain M1_SF370 bacteria lacking the M1128 gene (Fig. 166 I). 

As discussed above, Applicants have identified at least four different Group A Streptococcus 

25 Adhesin Islands. While these GAS AI sequences can be identified in numerous M types, Applicants 
have surprisingly discovered a correlation between the four main pilus subunits from the four 
different GAS AI types and specific T classifications. While other trypsin-resistant surface exposed 
proteins are likely also implicated in the T classification designations, the discovery of the role of the 
GAS adhesin islands (and the associated hyper-oligomeric pilus like structures) in T classification and 

30 GAS serotype variance has important implications for prevention and treatment of GAS infections. 
Applicants have identified protein components within each of the GAS adhesin islands which are 
associated with the pilus formation. These proteins are believed to be involved in the bacteria's initial 
adherence mechanisms. Immunological recognition of these proteins may allow the host immune 
response to slow or prevent the bacteria's transition into the more pathogenic later stages of infection. 

35 In addition, the GAS pili may be involved in formation of biofilms. Applicants have discovered that 

the GBS pili structures appear to be implicated in the formation of biofilms (populations of bacteria 

growing on a surface, often enclosed in an exopolysaccharide matrix). Biofilms are generally 

associated with bacterial resistance, as antibiotic treatments and host immune response are frequently 
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response against surface proteins exposed during the first steps of bacterial attachment (i.e., before 
complete biofilm formation) is preferable. 

The invention therefore provides for improved immunogenic compositions against GAS 
5 infection which may target GAS bacteria during their initial attachment efforts to the host epithelial 
cells and may provide protection against a wide range of GAS serotypes. The immunogenic 
compositions of the invention include GAS AI surface proteins which may be formulated in an 
oligomeric, or hyperoligomeric (pilus) form. The invention also includes combinations of GAS AI 
surface proteins. Combinations of GAS AI surface proteins may be selected from the same adhesin 
10 island or they may be selected from different GAS adhesin islands. 

The invention comprises compositions comprising a first GAS AI protein and a second GAS 
AI protein wherein the first and second GAS AI proteins are derived from different GAS adhesin 
islands. For example, the invention includes a composition comprising at least two GAS AI proteins 
wherein the GAS AI proteins are encoded by the adhesin islands selected from the group consisting of 
15 GAS AI-1 and AI-2; GAS AI-1 and GAS AI-3; GAS AI-1 and GAS AI-4; GAS AI-2 and GAS AI-3; 
GAS AI-2 and GAS AI-4; and GAS AI-3 and GAS AI-4. Preferably the two GAS AI proteins are 
derived from different T-types. 

A schematic arrangement of GAS Adhesin Island sequences is set forth in FIGURE 162. In 
all strains, the AI region is flanked by the highly conserved open reading frames Ml_123 and Ml- 
20 136. Between three and five genes in each locus code for surface proteins containing LPXTG motifs. 
These surface proteins also all belong to the family of genes coding for ECM binding adhesins. 

Adhesin island sequences can be identified in numerous M types of Group A Streptococcus. 
Examples of AI sequences within Ml, M6, M3, M5, Ml 2, M18, and M49 serotypes are discussed 
below. 

25 GAS Adhesin Islands generally include a series of open reading frames within a GAS genome 

that encode for a collection of surface proteins and sortases. A GAS Adhesin Island may encode for 
amino acid sequences comprising at least one surface protein. Alternatively, a GAS Adhesin Island 
may encode for at least two surface proteins and at least one sortase. Preferably, a GAS Adhesin 
Island encodes for at least three surface proteins and at least two sortases. One or more of the surface 
30 proteins may include an LPXTG motif (such as LPXTG (SEQ ID NO: 122)) or other sortase substrate 
motif. One or more GAS AI surface proteins may participate in the formation of a pilus structure on 
the surface of the Gram positive bacteria. 

GAS Adhesin Islands of the invention preferably include a divergently transcribed 
transcriptional regulator. The transcriptional regulator may regulate the expression of the GAS AI 
35 operon. Examples of transcriptional regulators found in GAS AI sequences include RofA and Nra. 

The GAS AI surface proteins may bind or otherwise adhere to fibrinogen, fibronectin, or 
■ collagen. One or more of the GAS AI surface proteins may comprise a fimbrial structural subunit. 
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One or more or the GAS AI surface proteins may include an LPXTG motif or other sortase 
substrate motif. The LPXTG motif may be followed by a hydrophobic region and a charged C 
terminus, which are thought to retard the protein in the cell membrane to facilitate recognition by the 
membrane-localized sortase. See Barnett, et aL ? J. Bacteriology (2004) 186 (17): 5865-5875. 
5 GAS AI sequences may be generally categorized as Type 1, Type 2, Type 3, or Type 4, 

depending on the number and type of sortase sequences within the island and the percentage identity 
of other proteins (with the exception of RofA and cpa) within the island. Figure 167 provides a chart 
indicating the number and type of sortase sequences identified within the adhesin islands of various 
strains and serotypes of GAS. As can be seen in this figure, all GAS strains and serotypes thus far 

10 characterized as an AI-1 have a SrtB type sortase, all GAS strains and serotypes thus far characterized 
as an AI-2 have SrtB and SrtCl type sortases, all GAS strains and serotypes thus far characterized as 
an AI-3 have a SrtC2 type sortase, and all GAS strains and serotypes thus far characterized as an AI-4 
have SrtB and SrtC2 type sortases. A comparison of the percentage identity of sequences within the 
adhesin islands was presented in Table 45, see above. 

15 (1) Adhesin Island sequence within M6: GAS Adhesin Island 1 ("GAS AI-1") 

A GAS Adhesin Island within M6 serotype (MG AS 10394) is outlined in Table 4 below. This 
GAS adhesin island 1 ("GAS AI-1") comprises surface proteins, a srtB sortase and a rofA divergently 
transcribed transcriptional regulator. 

GAS AI-1 surface proteins include Spy0157 (a fibronectin binding protein), Spy0159 (a 

20 collagen adhesion protein) and Spy0160 (a fimbrial structural subunit). Preferably, each of these 

GAS AI-1 surface proteins includes an LPXTG sortase substrate motif, such as LPXTG (SEQ ID NO: 
122) or LPXSG (SEQ ID NO: 134) (conservative replacement of threonine with serine). 

GAS AI-1 includes a srtB type sortase. GAS srtB sortases may preferably anchor surface 
proteins with an LPSTG motif (SEQ ID NO: 166), particularly where the motif is followed by a 

25 serine. 



Table 4: GAS AI-1 sequences from M6 isolate (MGAS10394) 



AI-1 sequence 
identifier 


Sortase substrate 
sequence or sortase 
type 


functional description 


M6_Spy0156 




Transcriptional regulator {rofA) 


M6_Spy0157 


LPXTG 


Fibronectin-binding protein 


M6_Spy0158 




Reverse transcriptase 


M6_Spy01 59 


LPXSG 


Collagen adhesion protein 


M6_Spy0160 


LPXTG 


Fimbrial structural subunit 


M6_JSpy0161 


srtB 


Sortase 



M6_Spy0160 appears to be present on the surface of GAS as part of oligomeric (pilus) 
structures. Figures 127-132 present electron micrographs of GAS serotype M6 strain 3650 
30 immunogold stained for M6_Spy0160 using anti-M6_Spy0160 antiserum. Oligomeric or 
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hy^eroligmeric siirac^ particles can be seen extending from the surface of the 

GAS in each of these figures, indicating the presence of multiple M6_Spy0160 polypeptides in the 
oligomeric or hyperoligomeric structures. Figure 176 A-F present electron micrographs of GAS M6 
strain 2724 immunogold stained for M6__Spy0160 using anti-M6_Spy0160 antiserum (Figures 176 A- 
5 E) or immunogold stained for M6_Spy0159 using anti-M6_Spy0159 antiserum (Figure 176 F). 
Oligomeric or hyperoligomeric structures labelled with gold particles can again be seen extending 
from the surface of the M6 strain 2724 GAS bacteria immunogold stained for M6_Spy0160. 
M6_Spy0159 is also detected on the surface of the M6 strain 2724 GAS. 

FACS analysis has confirmed that the GAS AI-1 surface proteins spyM6_0159 and 

10 spyM6_0160 are indeed expressed on the surface of GAS. Figure 73 provides the results of FACS 
analysis for surface expression of spyM6__0159 on each of GAS serotypes M6 2724, M6 3650, and 
M6 2894. A shift in fluorescence is observed for each GAS serotype when anti-spyM6_0159 
antiserum is present, demonstrating cell surface expression. Table 18, below, quantitatively 
summarizes the FACS fluorescence values obtained for each GAS serotype in the presence of pre- 

15 immune antiserum, anti-spyM6_0 159 antiserum, and the difference in fluorescence value between the 
pre-immune and anti-spyM6_0159 antiserum. 



Table 18; Summary of FACS values for surface expression of spyM6 0159 



2724 


3650 


2894 


Pre- 
immune 


Anti- 

spyM6 0159 


Change 


Pre- 
immune 


Anti- 

spyM6_0159 


Change 


Pre- 
immune 


Anti- 

spyM6J)159 


Change 


134.84 


427.48 


293 


149.68 


712.62 


563 


193.86 


597.8 


404 



Figure 74 provides the results of FACS analysis for surface expression of spyM6_0160 on 
each of GAS serotypes M6 2724, M6 3650, and M6 2894. In the presence of of anti-spyM6J) 1 60 



20 antiserum, a shift in fluorescence is observed for each GAS serotype, which demonstrates its cell 
surface expression. Table 19, below, quantitatively summarizes the FACS fluorescence values 
obtained for each GAS serotype in the presence of pre-immune antiserum, anti-spyM6_0160 
antiserum, and the change in fluorescence value between the pre-immune and anti-spyM6_0160 
antiserum. 

25 Table 19: Summary of FACS values for surface expression of spyM6_0160 



2724 


3650 


2894 


Pre- 
immune 


Anti- 

spyM6 0160 


change 


Pre- 
immune 


Anti- 

spyM6_0160 


change 


Pre- 
immune 


Anti- 

spyM6 0160 


change 


117.12 


443.24 


326 


128.57 


776.39 


648 


125.87 


621.17 


495 



Surface expression of M6_Spy0159 and M6_Spy0160 on M6 serotype GAS has also been 
confirmed by Western blot analysis. Figure 98 shows that while pre-immune sera (P ot-0159) does 
not detect expression of M6_Spy0159 in GAS serotype M6, anti-M6_Spy0159 immune sera (I ot- 
0159) is able to detect M6_Spy0159 protein in both total GAS M6 extracts (M6 tot) and GAS M6 



30 fractions enriched for cell surface proteins (M6 surf prot). The M6_Spy0159 proteins detected in the 
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'kroAS' , Mo extracts or the'GAS'MS extracts enriched for surface proteins are also present as high 
molecular weight structures, indicating that M6_Spy0159 may be in an oligomeric (pilus) form. 

Figure 112 shows that while preimmune sera (Preimmune Anti 106) does not detect 
expression of M6JSpy0160 in GAS serotype M6 strain 2724, anti-M6_SpyO 1 60 immune sera (Anti 
5 160) does in both total GAS M6 strain 2724 extracts (M6 2724 tot) and GAS M6 strain 2724 fractions 
enriched for surface proteins. The M6_Spy0160 proteins detected in the total GAS M6 strain 2724 
extracts or the GAS M6 strain 2724 extracts enriched for surface proteins are also present as high 
molecular weight structures, indicating that M6_Spy0160 may be in an oligomeric (pilus) form. 

Figures 110 and 111 both further verify the presence of M6_Spy0159 and M6_Spy0160 in 
10 higher molecular weight structures on the surface of GAS. Figure 110 provides a Western blot 
performed to detect M6_Spy0159 and M6_Spy0160 in GAS M6 strain 2724 extracts enriched for 
surface proteins. Antiserum raised against either M6_Spy0159 (Anti-159) or M6_Spy0160 (Anti- 
160) cross-hybridizes with high molecular weight structures (pili) in these extracts. Figure 1 1 1 
provides a similar Western blot that verifies the presence of M6_ Spy0159 and M6_Spy0160 in high 
15 molecular weight structures in GAS M6 strain 3650 extracts enriched for surface proteins. 

SpyM6_0157 (a fibronectin-binding protein) may also be expressed on the surface of GAS 
serotype M6 bacteria. Figure 174 shows the results of FACS analysis for surface expression of 
spyM6_0157 on M6 strain 3650. A slight shift in fluorescence is observed, which demonstrates that 
some spyM6 0157 may be expressed on the GAS cell surface. 
20 Adhesin Island sequence within M6: GAS Adhesin Island 2 ("GAS AI-2") 

A GAS Adhesin Island within Ml serotype (SF370) is outlined in Table 5 below. This GAS 
adhesin island 2 ("GAS AI-2") comprises surface proteins, a SrtB sortase, a SrtCl sortase and a RofA 
divergently transcribed transcriptional regulator. 

GAS AI-2 surface proteins include GAS 15 (Cpa), Spy0128 (thought to be a fimbrial protein) 
25 and Spy0130 (a hypothetical protein). Preferably, each of these GAS AI-2 surface proteins includes 
an LPXTG sortase substrate motif, such as LPXTG (SEQ ID NO: 122), VVXTG (SEQ ID NO: 135), 
or EVXTG (SEQ ID NO: 136). 

GAS AI-2 includes a srtB type sortase and a srtCl sortase. As discussed above, GAS SrtB 
sortases may preferably anchor surface proteins with an LPSTG (SEQ ID NO: 166) motif, particularly 
30 where the motif is followed by a serine. GAS SrtCl sortase may preferentially anchor surface 

proteins with a V(P/V)PTG (SEQ ID NO: 167) motif GAS SrtCl may be differentially regulated by 
RofA. 



GAS AI-2 may also include a LepA putative signal peptidase I protein. 
Table 5 : GAS AI-2 sequence from Ml isolate (SF370) 



AI-2 sequence 
identifier 


Sortase substrate 
sequence or sortase 
type 


functional description 


SPy0124 




rofA regulatory protein 


GAS15(not annotated in SF370) 


WXTG 


cpa 
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3 *3t 


LepA putative signal peptidase I 


SPy0128 (GAS 16) 


EVXTG 


hypothetical protein (fimbrial) 


SPy0129 (GAS 17) 


srtCl 


sortase 


SPy0130(GAS18) 


LPXTG 


hypothetical protein 


SPy0131 




conserved hypothetical protein 


SPy0133 




conserved hypothetical protein 


SPy0135 (GAS20) 


srtB 


sortase (putative fimbrial- 
associated protein) 



GAS 15, GAS 16, and GAS 18 appear to be present on the surface of GAS as part of 
oligomeric (pilus) structures. Figures 113-115 present electron micrographs of GAS serotype Ml 
strain SF370 immunogold stained for GAS 15 using anti-GAS 15 antiserum. Figures 116-121 provide 
electron micrographs of GAS serotype Ml strain SF370 immunogold stained for GAS 16 using anti- 



5 GAS 16 antiserum. Figures 122-125 present electron micrograph of GAS serotype Ml strain SF370 
immunogold stained for GAS 18 using anti-GAS 18 antiserum. Oligomers of these proteins can be 
seen on the surface of SF370 bacteria in the immuno-gold stained micrographs. 

Figure 126 reveals a hyperoligomer on the surface of a GAS serotype Ml strain SF370 
bacterium immunogold stained for GAS 18. This long hyperoliogmeric structure comprising GAS 18 
10 stretches far out into the supernatant from the surface of the bacteria. 

FACS analysis has confirmed that the GAS AI-2 surface proteins GAS 15, GAS 16, and GAS 
18 are expressed on the surface of GAS. Figure 75 provides the results of FACS analysis for surface 
expression of GAS 15 on each of GAS serotypes Ml 2719, Ml 2580, Ml 3280, Ml SF370, Ml 2913, 
and Ml 3348. A shift in fluorescence is observed for each GAS serotype when anti-GAS 15 
15 antiserum is present, demonstrating cell surface expression. Table 20, below, quantitatively 

summarizes the FACS fluorescence values obtained for each GAS serotype in the presence of pre- 
immune antiserum, anti-GAS 15 antiserum, and the difference in fluorescence value between the pre- 
immune and anti-GAS 15 antiserum. 



Table 20: Summary of FACS values for surface expression of GAS 15 



2719 


2580 


3280 


Pre- 
immune 


Anti-GAS 
15 


Change 


Pre- 

immune 


Anti-GAS 
15 


Change 


Pre- 
immune 


Anti-GAS 
15 


Change 


159.46 


712.71 


553 


123,9 


682.84 


559 


217.02 


639.69 


423 




SF370 


2913 


3348 


Pre- 
immune 


Anti-GAS 
15 


Change 


Pre- 

immune 


Anti-GAS 
15 


Change 


Pre- 
immune 


Anti-GAS 
15 


Change 


201.93 


722.68 


521 


121.41 


600.45 


479 


152.09 


446.41 


294 



20 Figures 76 and 79 provide the results of FACS analysis for surface expression of GAS 16 on 

each of GAS serotypes Ml 2719, Ml 2580, Ml 3280, Ml SF370, Ml 2913, and Ml 3348. The 
FACS data in Figure 76 was obtained using antisera was raised against full length GAS 16. In the 
presence of this anti-GAS 16 antiserum, a shift in fluorescence is observed for each GAS serotype, 
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dffila^ 21, below, quantitatively summarizes the FACS 

fluorescence values obtained for each GAS serotype in the presence of pre-immune antiserum, anti- 
GAS 16 antiserum, and the change in fluorescence value between the pre-immune and anti-GAS 16 
antiserum. 

5 Table 2 1 : Summary of FAC S values for surface expression of GAS 1 6 



2719 


2580 


3280 


Pre- 
immune 


Anti-GAS 
16 


Change 


Pre- 
immune 


Anti-GAS 
16 


Change 


Pre- 
immune 


Anti-GAS 
16 


Change 


233.27 


690.09 


457 


133.82 


732.29 


598 


264.47 


649.43 


385 




SF370 


2913 


3348 


Pre- 
immune 


Anti-GA'S 
16 


Change 


Pre- 
immune 


Anti-GAS 
16 


Change 


Pre- 
immune 


Anti-GAS 
16 


Change 


237.2 


727.46 


490 


138.52 


588.04 


450 


180.56 


420.93 


240 



The FACS data in Figure 79 was obtained using antisera was raised against a truncated GAS 



16, which is encoded by SEQ ID NO: 179, shown below. 
SEQ ID NO: 179: 

GCTACAACAGTTCACGGGGAGACTGTTGTAAACGGAGCCAAACTAACAGTTACAAAAAACCTTGATTTAGTTAAT 
10 AGCAATGCATTAATTCCAAATACAGATTTTACATTTAAAATCGAACCTGATACTACTGTCAACGAAGACGGAAAT 
AAGTTTAAAGGTGTAGCTTTGAACACACCGATGACTAAAGTCACTTACACCAATTCAGATAAAGGTGGATCAAAT 
ACGAAAACTGCAGAATTTGATTTTTCAGAAGTTACTTTTGAAAAACCAGGTGTTTATTATTACAAAGTAACTGAG 
GAGAAGATAGATAAAGTTCCTGGTGTTTCTTATGATACAACATCTTACACTGTTCAAGTTCATGTCTTGTGGAAT 
GAAGAGCAACAAAAACCAGTAGCTACTTATATTGTTGGTTATAAAGAAGGTAGTAAGGTGCCAATTCAGTTCAAA 
15 AATAGCTTAGATTCTACTACATTAACGGTGAAGAAAAAAGTTTCAGGTACCGGTGGAGATCGCTCTAAAGATTTT 
AATTTTGGTCTGACTTTAAAAGCAAATCAGTATTATAAGGCGTCAGAAAAAGTCATGATTGAGAAGACAACTAAA 
GGTGGTCAAGCTCCTGTTCAAACAGAGGCTAGTATAGATCAACTCTATCATTTTACCTTGAAAGATGGTGAATCA 
AT C AAAG T C AC A AAT C T T C C AG TAGGTGTGGATTATGTTGTCACT G AAG AC GAT TAG A A AT C AG AAA A AT AT A C A 
ACCAACGTGGAAGTTAGTCCTCAAGATGGAGCTGTAAAAAATATCGCAGGTAATTCAACTGAACAAGAGACATCT 
20 ACT GAT AAAG AT AT G ACC ATT AC T T T TAG A AAT AAA A AAG AT T T 

In the presence of this anti-GAS 16 antiserum, a shift in fluorescence is observed for each GAS 
serotype, demonstrating its cell surface expression. Table 22, below, quantitatively summarizes the 
FACS fluorescence values obtained for each GAS serotype in the presence of pre-immune antiserum, 
anti-GAS 16 antiserum, and the change in fluorescence value between the pre-immune and anti-GAS 
25 16 antiserum. 



Table 22: Summary of FACS values for surface expression of GAS 16 using a second antisera 



2719 


2580 


3280 


Pre- 
immune 


Anti-GAS 
16 


Change 


Pre- 
immune 


Anti-GAS 
16 


Change 


Pre- 
immune 


Anti-GAS 
16 


Change 


141.55 


650.22 


509 


119.57 


672.35 


553 


209.18 


666.71 


458 




SF370 


2913 


3348 


Pre- 
immune 


Anti-GAS 
16 


Change 


Pre- 
immune 


Anti-GAS 
16 


Change 


Pre- 
immune 


Anti-GAS 
16 


Change 


159.92 


719.32 


559 


115.97 


585.9 


470 


146.1 


414.01 


268 
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P C "figuiil Ql?8'piiviiyiiii^&lt^ of FACS analysis for surface expression of GAS 18 on 
each of GAS serotypes Ml 2719, Ml 2580, Ml 3280, Ml SF370, Ml 2913, and Ml 3348. The 
antiserum used to obtain the FACS data in each of Figures 77 and 78 was different, although each was 
raised against full length GAS 18. In the presence of each of the anti-GAS 18 antisera, a shift in 
5 fluorescence is observed for each GAS serotype, demonstrating its cell surface expression. Tables 23 
and 24, below, quantitatively summarizes the FACS fluorescence values obtained for each GAS 
serotype in the presence of pre-immune antiserum, first or second anti-GAS 18 antiserum, and the 
change in fluorescence value between the pre-immune and first or second anti-GAS 1 8 antiserum. 



Table 23 : Summary of FACS values for surface expression of GAS 1 8 



2719 


2580 


3280 


Pre- 
immune 


Anti-GAS 
18 


Change 


Pre- 
immune 


Anti-GAS 
18 


Change 


Pre- 
immune 


Anti-GAS 
18 


Change 


135.68 


327.98 


192 


116.32 


379.41 


263 


208.12 


380.84 


173 




SF370 


2913 


3348 


Pre- 
immune 


Anti-GAS 
18 


Change 


Pre- 
immune 


Anti-GAS 
18 


Change 


Pre- 
immune 


Anti-GAS 
18 


Change 


185.39 


438.23 


253 


119.95 


373.32 


253 


147.12 


266.51 


119 


Table 24: Summary of FACS values for surface expression of GAS 18 using a second antisera 


2719 


2580 


3280 


Pre- 
immune 


Anti-GAS 
18 


Change 


Pre- 
immune 


Anti-GAS 
18 


Change 


Pre- 
immune 


Anti-GAS 
18 


Change 


150.4 


250.39 


100 


139.18 


386.38 


247 


253.38 


347.72 


94 




SF370 


2913 


3348 


Pre- 
immune 


Anti-GAS 
18 


Change 


Pre- 
immune 


Anti-GAS 
18 


Change 


Pre- 
immune 


Anti-GAS 
18 


Change 


188.64 


373.11 


184 


124.94 


384.82 


260 


168.8 


213.65 


45 



Surface expression of GAS 15, GAS 16, and GAS 18 on Ml serotype GAS has also been 
confirmed by Western blot analysis. Figure 89 shows that while pre-immune sera does not detect 
GAS Ml expression of GAS 15, anti-GAS 15 immune sera is able to detect GAS 15 protein in both 
total GAS Ml extracts and GAS Ml proteins enriched for cell surface proteins. The GAS 15 proteins 



15 detected in the Ml extracts enriched for surface proteins are also present as high molecular weight 
structures, indicating that GAS 15 may be in an oligomeric (pilus) form. Figure 90 also shows the 
results of Western blot analysis of Ml serotype GAS using anti-GAS 15 antisera. Again, the lanes 
that contain GAS Ml extracts enriched for surface proteins (Ml prot sup) show the presence of high 
molecular weight structures that may be oligomers of GAS 15. Figure 91 provides an additional 

20 Western blot identical to that of Figure 90, but that was probed with pre-immune sera. As expected, 
no proteins were detected on this membrane. 

Figure 92 provides a Western blot that was probed for GAS 16 protein. While pre-immune 
sera does not detect GAS Ml expression of GAS 16, anti-GAS 16 immune sera is able to detect GAS 

-112- 



WO 2006/078318 PCT/US2005/027239 

ll pKteln''m yM'S 55 ltfi" : ikrafi§ enrfeffik^for cell surface proteins. The GAS 16 proteins detected in 
the Ml extracts enriched for surface proteins are present as high molecular weight structures, 
indicating that GAS 16 may be in an oligomeric (pilus) form. Figure 93 also shows the results of 
Western blot analysis of Ml serotype GAS using anti-GAS 16 antisera. The lanes that contain total 
5 GAS Ml protein (Ml tot new and Ml tot old) and the lane that contains GAS Ml extracts enriched 
for surface proteins (Ml prot sup) show the presence of high molecular weight structures that may be 
oligomers of GAS 16. Figure 94 provides an additional Western blot identical to that of Figure 93, 
but that was probed with pre-immune sera. As expected, no proteins were detected on this membrane. 
Figure 95 provides a Western blot that was probed for GAS 1 8 protein. While pre-immune 

10 sera does not detect GAS Ml expression of GAS 18, anti-GAS 18 immune sera is able to detect GAS 
18 protein in GAS Ml extracts enriched for cell surface proteins. The GAS 18 proteins detected in 
the Ml extracts enriched for surface proteins are present as high molecular weight structures, 
indicating that GAS 18 may be in an oligomeric (pilus) form. Figure 96 also shows the results of 
Western blot analysis of Ml serotype GAS using anti-GAS 18 antisera. The lane that contains GAS 

15 Ml extracts enriched for surface proteins (Ml prot sup) show the presence of high molecular weight 
structures that may be oligomers of GAS 18. Figure 97 provides an additional Western blot identical 
to that of Figure 96, but that was probed with pre-immune sera. As expected, no proteins were 
detected on this membrane. 

Figures 102-106 provide additional Western blots to verify the presence of GAS 15, GAS 16, 

20 and GAS 18 in high molecular weight structures in GAS. Each Western blot was performed using 
proteins from a different GAS Ml strain, 2580, 2913, 3280, 3348, and 2719. Each Western blot was 
probed with antisera raised against each of GAS 15, GAS 16, and GAS 18. As can be seen in Figures 
102-106, none of the Western blots shows detection of proteins using pre-immune serum (Pct-158, 
Pa-15, Pa- 16, or Pa-1 8), while each Western blot shows cross-hybridization of the GAS 15 (la- 15), 

25 GAS 16 (la- 16), and GAS 18 (la- 18) antisera to high molecular weight structures. Thus, these 
Western blots confirm that GAS 15, GAS 16, and GAS 18 can be present in pili in GAS Mi- 
Figure 107 provides a similar Western blot performed to detect GAS 15, GAS 16, and GAS 
18 proteins in a GAS serotype Ml strain SF370 protein fraction enriched for surface proteins. This 
Western blot also shows detection of GAS 15 (Anti-15), GAS 16 (Anti-16), and GAS 18 (Anti-18) as 

30 high molecular weight structures. 

(3) Adhesin Island sequence within M3. MS. and Ml 8: GAS Adhesin Island 3 ("GAS AI-3") 
GAS Adhesin Island sequences within M3, M5, and M18 serotypes are outlined in Tables 6 — 
8 and 10 below. This GAS adhesin island 3 ("GAS AI-3") comprises surface proteins, a SrtC2 
sortase, and a Negative transcriptional regulator (Nra) divergently transcribed transcriptional 
35 regulator. 

GAS AI-3 surface proteins within include a collagen binding protein, a fimbrial protein, a F2 
like fibronectin-binding protein. GAS AI-3 surface proteins may also include a hypothetical surface 1 
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pJKfc'l^flriS^J^^f surface proteins include an LPXTG sortase substrate 

motif, such as LPXTG (SEQ ID NO: 122), VPXTG (SEQ ID NO: 137), QVXTG (SEQ ID NO: 138) 
or LPXAG (SEQ ID NO: 139). 

GAS AI-3 includes a SrtC2 type sortase. GAS SrtC2 type sortases may preferably anchor 
5 surface proteins with a QVPTG (SEQ ID NO: 140) motif, particularly when the motif is followed by a 
hydrophobic region and a charged C terminus tail. GAS SrtC2 may be differentially regulated by 
Nra. 

GAS AI-3 may also include a LepA putative signal peptidase I protein. 
GAS AI-3 may also include a putative multiple sugar metabolism regulator. 
10 Table 6: GAS AI-3 sequences from M3 isolate (MGAS315) 



AI-3 sequence 
identifier 


Sortase substrate 
sequence or sortase 
type 


Functional description 


SpyM3_0097 




Negative transcriptional regulator (Nra) 


SpyM3_0098 


VPXTG 


putative collagen binding protein (Cpb) 


SpyM3J)099 




LepA putative signal peptidase I 


SpyM3J)100 


QVXTG 


conserved hypothetical protein (fimbrial) 


SpyM3_0101 


SrtC2 


sortase 


SpyM3J)102 


LPXAG 


hypothetical protein 


SpyM3_0103 




putative multiple sugar metabolism regulator 


SpyM3_0104 


LPXTG 


protein F2 like fibronectin-binding protein 



AI-3 sequence 
identifier 


Sortase Substrate 
sequence or sortase 
type 


Functional description 


SPs0099 




Negative transcriptional regulator (Nra) 


SPsOlOO 


VPXTG 


putative collagen binding protein (Cpb) 


SPsOlOl 




LepA putative signal peptidase I 


SPs0102 


QVXTG 


conserved hypothetical protein (fimbrial) 


SPs0103 


SrtC2 ■ 


sortase 


SPs0104 


LPXAG 


hypothetical protein 


SPs0105 




putative multiple sugar metabolism regulator 


SPs0106 


LPXTG 


protein F2 like fibronectin-binding protein 



AI-3 sequence 
identifier 


Sortase substrate 
sequence or 
sortase type 


Functional description 


orf77 




Negative transcriptional regulator (Nra) 


orf78 


VPXTG 


putative collagen binding protein (Cpb) 


orf79 




LepA putative signal peptidase I 


orf80 


QVXTG 


conserved hypothetical protein (fimbrial) 


orfSl 


SrtC2 


sortase 
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hypothetical protein 


orf83 




putative multiple sugar metabolism regulator 


orf84 


LPXTG 


protein F2 like fibronectin-binding protein 



Table 8: GAS AI-3 sequences from M18 isolate (MGAS8232) 



AL-3 sequence 
identifier 


Sortase substrate 
sequence or sortase 
type 


Functional description 


spyM18J)125 




Negative transcriptional regulator (Nra) 
(N-terminal fragment) 


spyM18_0126 


VPXTG 


putative collagen binding protein (Cpb) 


spyM18_0127 




LepA putative signal peptidase I 


spyM18_0128 


QVXTG 


conserved hypothetical protein (fimbrial) 


spyM18_0129 


SrtC2 


sortase 


spyM18J)130 


LPXAG 


hypothetical protein 


spyM18_0131 




putative multiple sugar metabolism regulator 


spyM18_0132 


LPXTG 


protein F2 like fibronectin-binding protein 



5 Table 44: GAS AI-3 sequences from M49 isolate (591) 



AI-3 sequence 
identifier 


Sortase substrate 
sequence or sortase 
type 


Functional description 


SpyoM01000156 




Negative transcriptional regulator (Nra) 


SpyoMO 1000 155 


VPXTG 


collagen binding protein (Cpa) 


SpyoMO 1000 154 




LepA putative signal peptidase I 


SpyoM01000153 


QVXTG 


conserved hypothetical protein (fimbrial) 


SpyoM01000152 


SrtC2 


sortase 


SpyoM01000151 


LPXAG 


hypothetical protein 


SpyoMO 1000 150 




MsmRL 


SpyoMO 1000 149 


LPXTG 


protein F2 like fibronectin-binding protein 



A schematic of AI-3 serotypes M3, M5, M18, and M49 is shown in Figure 51A. Each 
contains an open reading frame encoding a SrtC2-type sortase of nearly identical amino acid 
1 10 sequence. See Figure 52B for an amino acid sequence alignment for each of the SrtC2 amino acid 
sequences. 

The protein F2-like fibronectin-binding protein of each these type 3 adhesin islands contains a 
pilin motif and an E-box. Figure 60 indicates the amino acid sequence of the pilin motif and E-box of 
each of GAS AI-3 serotype M3 MGAS315 (SpyM3_0 104/2 1909640), GAS AI-3 serotype M3 SSI 
15 (Sps0106/28895018), GAS AI-3 serotype M18 (SpyM 18J> 13 21 19 745 3 07), and GAS AI-3 serotype 
M5 (orf84). 

FACS analysis has confirmed that the GAS AI-3 surface proteins SpyM3_0098, 
SpyM3_0100, SpyM3_0102, and SpyM3_0104 are expressed on the surface of GAS. Figure 80 
provides the results of FACS analysis for surface expression of SpyM3_0098 on each of GAS 
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sSU^pefmWSi QJ5rf$ S3?! 3 ilttfftn fluorescence is observed for each GAS serotype when 
anti-SpyM3_0098 antiserum is present, demonstrating cell surface expression. Table 25, below, 
quantitatively summarizes the FACS fluorescence values obtained for each GAS serotype in the 
presence of pre-immune antiserum, anti-SpyM3_0098 antiserum, and the difference in fluorescence 
5 value between the pre-immune and anti-SpyM3_0098 antiserum. 



Table 25: Summary of FACS values for surface expression of SpyM3_0098 



2721 


3135 


Pre-immune 


Anti- 

spyM3 0098 


Change 


Pre-immune 


Anti- 

spyM3 0098 


Change 


117.85 


249.51 


132 


99.17 


277.21 


178 



Figure 81 provides the results of FACS analysis for surface expression of SpyM3_0100 on 
each of GAS serotypes M3 2721 and M3 3 135. A shift in fluorescence is observed for each GAS 
serotype when anti-SpyM3_0100 antiserum is present, demonstrating cell surface expression. Table 
10 26, below, quantitatively summarizes the FACS fluorescence values obtained for each GAS serotype 
in the presence of pre-immune antiserum, anti-SpyM3_0100 antiserum, and the difference in 
fluorescence value between the pre-immune and anti-SpyM3_0100 antiserum. 



Table 26: Summary of FACS values for surface expression of SpyM3_0100 



2721 


3135 


Pre-immune 


Anti- 

spyM3 0100 


Change 


Pre-immune 


Anti- 

spyM3 0100 


Change 


110.31 


181.91 


72 


97.87 


250.01 


152 



Figure 82 provides the results of FACS analysis for surface expression of SpyM3_0102 on 
15 each of GAS serotypes M3 272 1 and M3 3 135. A shift in fluorescence is observed for each GAS 

serotype when anti-SpyM3_0102 antiserum is present, demonstrating cell surface expression. Table 
27, below, quantitatively summarizes the FACS fluorescence values obtained for each GAS serotype 
in the presence of pre-immune antiserum, anti-SpyM3 0102 antiserum, and the difference in 
fluorescence value between the pre-immune and anti-SpyM3_0 1 02 antiserum. 
20 Table 27: Summary of FACS values for surface expression of SpyM3J3102 in M3 serotypes 



2721 


3135 


Pre-immune 


Anti- 

>spyM3_0102 


Change 


Pre-immune 


Anti- 

spyM3 0102 


Change 


109.86 


155.26 


45 


100.02 


112.58 


13 



Figure 82 also provides the results of FACS analysis for surface expression of apilin antigen 
that has homology to SpyM3_0102 identified in a different GAS serotype, M6. FACS analysis 
conducted with the SpyM3_0102 antisera was able to detect surface expression of the homologous 
SpyM3J)102 antigen on each of GAS serotypes M6 2724, M6 3650, and M6 2894. Table 28, below, 
25 quantitatively summarizes the FACS fluorescence values obtained for each GAS serotype in the 

presence of pre-immune antiserum, anti-SpyM3_0102 antiserum, and the difference in fluorescence 
value between the pre-immune and anti-SpyM3_0102 antiserum. 

Table 28: Summary of FACS values for surface expression of SpyM3_0102 in M6 serotypes 
| 2724 | 3650 1 2894 
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immune 


^JgHS 0:i: 

spyM3 0102 


i ./ Ei ^ 

Change 


| 

immune 


Anti- 

spyM3 0102 


Change 


Pre- 

immune 


Anti- 

spyM3 0102 


Change 


146.59 


254.03 


107 


162.56 


294.03 


131 


175.49 


313.69 


138 



SpyM3_0102 is also homologous to pilin antigen 19224139 of GAS serotype M12. Antisera 
raised against SpyM3_0102 is able to detect high molecular weight structures in GAS serotype M12 
strain 2728 protein fractions enriched for surface proteins, which would contain the 19224139 
antigen. See Figure 109 at the lane labelled Ml 2 2728 surf prot. 



5 Figure 83 provides the results of FACS analysis for surface expression of SpyM3_0104 on 

each of GAS serotypes M3 2721 and M3 3 135. A shift in fluorescence is observed for each GAS 
serotype when anti-SpyM3_0104 antiserum is present, demonstrating cell surface expression. Table 
29, below, quantitatively summarizes the FACS fluorescence values obtained for each GAS serotype 
in the presence of pre-immune antiserum, anti-SpyM3__0104 antiserum, and the difference in 
10 fluorescence value between the pre-immune and anti-SpyM3_0104 antiserum. 



Table 29: Summary of FACS values for surface expression of SpyM3_0104 in M3 serotypes 



2721 


3135 


Pre-immune 


Anti- 

spyM3 0104 


Change 


Pre-immune 


Anti- 

spyM3 0104 


Change 


128.45 


351.65 


223 


105.1 


339.88 


235 



Figure 83 also provides the results of FACS analysis for surface expression of a pilin antigen 
that has homology to SpyM3_0104 identified in a different GAS serotype, M12. FACS analysis 
conducted with the SpyM3_0104 antisera was able to detect surface expression of the homologous 
15 SpyM3_0104 antigen on GAS serotype M12 2728. Table 30, below, quantitatively summarizes the 
FACS fluorescence values obtained for this GAS serotype in the presence of pre-immune antiserum, 
anti-SpyM3_0104 antiserum, and the difference in fluorescence value between the pre-immune and 
anti-SpyM3_0104 antiserum. 

Table 30: Summary of FACS values for surface expression of SpyM3_0104 in an M12 serotype 



2728 


Pre-immune 


Anti-spyM3 0104 


Change 


198.57 


288.75 


90 



20 Figure 84 provides the results of FACS analysis for surface expression of SPs_0 1 06 on each 

of GAS serotypes M3 2721 and M3 3 135. A shift in fluorescence is observed for each GAS serotype 
when anti-SPsJ)106 antiserum is present, demonstrating cell surface expression. Table 3 1, below, 
quantitatively summarizes the FACS fluorescence values obtained for each GAS serotype in the 
presence of pre-immune antiserum, anti-SPs_0106 antiserum, and the difference in fluorescence value 

25 between the pre-immune and anti-SPs_0106 antiserum. 



Table 3 1 : Summary of FACS values for surface expression of SPsJ) 1 06 in M3 serotypes 



2721 


3135 


Pre-immune 


Anti-SPs 0106 


Change 


Pre-immune 


Anti-SPs 0106 


Change 


116 


463.28 


347 


103.02 


494.27 


391 



Figure 84 also provides the results of FACS analysis for surface expression of a pilin antigen 
that has homology to SPsJ)106 identified in a different GAS serotype, M12. FACS analysis 
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cKiJ^^ able to detect surface expression of the homologous 

SPs_0106 antigen on GAS serotype M12 2728. Table 32, below, quantitatively summarizes the 
FACS fluorescence values obtained for each GAS serotype in the presence of pre-immune antiserum, 
anti-SPs_0106 antiserum, and the difference in fluorescence value between the pre-immune and anti- 
5 SPs_0 106 antiserum. 

Table 32: Summary of FACS values for surface expression of SPs_0106 in an M12 serotype 



2728 


Pre-immune 


Anti-SPs 0106 


Change 


304.01 


254.64 


-49 



(4) Adhesin Island sequence within Ml 2: GAS Adhesin Island 4 ("GAS AI-4") 
GAS Adhesin Island sequences within M12 serotype are outlined in Table 1 1 below. This 
10 GAS adhesin island 4 ("GAS AI-4") comprises surface proteins, a SrtC2 sortase, and a RofA 
regulatory protein. 

GAS AI-4 surface proteins within may include a fimbrial protein, an F or F2 like fibronectin- 
binding protein, and a capsular polysaccharide adhesion protein (Cpa). GAS AI-4 surface proteins 
may also include a hypothetical surface protein in an open reading frame (orf). Preferably, each of 
15 these GAS AI-4 surface proteins include an LPXTG sortase substrate motif, such as LPXTG (SEQ ID 
NO: 122), VPXTG (SEQ ID NO: 137), QVXTG (SEQ ID NO: 138) or LPXAG (SEQ ID NO: 139). 

GAS AI-4 includes a SrtC2 type sortase. GAS SrtC2 type sortases may preferably anchor 
surface proteins with a QVPTG (SEQ ID NO: 140) motif, particularly when the motif is followed by a 
hydrophobic region and a charged C terminus tail. 
20 GAS AI-4 may also include a LepA putative signal peptidase I protein and a MsmRL protein. 



Table 11: GAS AI-4 sequences from M12 isolate (A735) 



AI-4 sequence 
identifier 


Sortase substrate 
sequence or sortase 
type 


Functional description 


19224133 




RofA regulatory protein 


19224134 


LPXTG 


protein F 




SrtB 


SrtB (stop co don*) 


19224135 


VPXTG 


Cpa 


19224136 




LepA 


19224137 


QVXTG 


EftLSL.A (fimbrial) 


19224138 


SrtC2 


EftLSL.B 


19224139 


LPXAG 


Orf2 


19224140 




MsmRL 


19224141 


LPXTG 


protein F2 



A schematic of AI-4 serotype M12 is shown in Figure 51A. 

One of the open reading frames encodes a SrtC2-type sortase having an amino acid sequence 
25 nearly identical to the amino acid sequence of the SrtC2-type sortase of the AI-3 serotypes described 
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afdvE le^M^jrS Slfe'r' JPaZiBa aliSlequence alignment for each of the SrtC2 amino acid 
sequences. 

Other proteins encoded by the open reading frames of the AI-4 serotype Ml 2 are homologous 
to proteins encoded by other known adhesin islands in S. pyogenes, as well as the GAS AI-3 serotype 
5 M5 (Manfredo). Figure 52 is an amino acid alignment of the capsular polysaccharide adhesion 

protein (cpa) of AI-4 serotype M12 (19224135), GAS AI-3 serotype M5 (ORF78), S. pyogenes strain 
MGAS315 serotype M3 (21909634), S. pyogenes SSI-1 serotype M3 (28810257), S. pyogenes 
MGAS8232 serotype M3 (19745301), and GAS AI-2 serotype Ml (GAS 15). The amino acid 
sequence of the AI-4 serotype M12 cpa shares a high degree of homology with other cpa proteins. 

10 Figure 53 shows that the F-like fibronectin-binding protein encoded by the AI-4 serotype 

M12 open reading frame (19224134) shares homology with a F-like fibronectin-binding protein found 
in S. pyogenes strain MGAS 10394 serotype M6 (50913503). 

Figure 54 is an amino acid sequence alignment that illustrates that the F2-like fibronectin- 
binding protein of AI-4 serotype M12 (19224141) shares homology with the F2-like fibronectin- 

15 binding protein of S. pyogenes strain MGAS8232 serotype M3 (19745307), GAS AI-3 serotype M5 

(ORF84), S. pyogenes strain SSI serotype M3 (28810263), and S. pyogenes strain MGAS315 serotype 
M3 (21909640). 

Figure 55 is an amino acid sequence alignment that illustrates that the fimbrial protein of AI-4 
serotype M12 (19224137) shares homology with the fimbrial protein of GAS AI-3 serotype M5 

20 (ORF80), and the hypothetical protein of S. pyogenes strain MGAS3 15 serotype M3 (21909636), S. 
pyogenes strain SSI serotype M3 (288 10259), S. pyogenes strain MGAS8732 serotype M3 
(19745303), and S. pyogenes strain Ml GAS serotype Ml (13621428). 

Figure 56 is an amino acid sequence alignment that illustrates that the hypothetical protein of 
GAS AI-4 serotype M12 (19224139) shares homology with the hypothetical protein of S. pyogenes 

25 strain MGAS315 serotype M3 (21909638), S. pyogenes strain SSI-1 serotype M3 (28810261), GAS 
AI-3 serotype M5 (ORF82), and S. pyogenes strain MGAS8232 serotype M3 (19745305). 

The protein F2-like fibronectin-binding protein of the type 4 adhesin island also contains a 
highly conserved pilin motif and an E-box. Figure 60 indicates the amino acid sequence of the pilin 
motif and E-box in AI-4 serotype Ml 2. 

30 FACS analysis has confirmed that the GAS AI-4 surface proteins 19224134, 19224135, 

19224137, and 19224141 are expressed on the surface of GAS. Figure 85 provides the results of 
FACS analysis for surface expression of 19224134 on GAS serotype M12 2728. A shift in 
fluorescence is observed when anti-19224134 antiserum is present, demonstrating cell surface 
expression. Table 33, below, quantitatively summarizes the FACS fluorescence values obtained for 

35 GAS serotype M12 2728 in the presence of pre-immune antiserum, anti-19224134 antiserum, and the 

difference in fluorescence value between the pre-immune and anti-19224134 antiserum. 

Table 33: Summary of FACS values for surface expression of 19224134 in an M12 serotype 

| 2728 I 
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C'TV USDS 




9tnti-19224134 


Change 


137.8 


485.32 


348 



Figure 85 also provides the results of FACS analysis for surface expression of a pilin antigen 
that has homology to 19224134 identified in a different GAS serotype, M6. FACS analysis conducted 
with the 19224134 antisera was able to detect surface expression of the homologous 19224134 
antigen on each of GAS serotypes M6 2724, M6 3650, and M6 2894. Table 34, below, quantitatively 
summarizes the FACS fluorescence values obtained for each GAS serotype in the presence of pre- 
immune antiserum, anti- 19224 134 antiserum, and the difference in fluorescence value between the 
pre-immune and anti-19224134 antiserum. 

Table 34: Summary of FACS values for surface expression of 19224134 in M6 serotypes 



2724 


3650 


2894 




Pre- 
immune 


Anti- 
19224134 


Change 


Pre- 
immune 


Anti- 
19224134 


Change 


Pre- 
immune 


Anti- 
19224134 


Change 


123,58 


264.59 


141 


140.82 


262.64 


122 


135.4 


307.25 


172 



Figure 86 provides the results of FACS analysis for surface expression of 19224135 on GAS 
serotype M12 2728. A shift in fluorescence is observed when anti-19224135 antiserum is present, 
demonstrating cell surface expression. Table 35, below, quantitatively summarizes the FACS 
fluorescence values obtained for GAS serotype M12 2728 in the presence of pre-immune antiserum, 
anti-19224135 antiserum, and the difference in fluorescence value between the pre-immune and anti- 
19224135 antiserum. 

Table 35: Summary of FACS values for surface expression of 19224135 in an M12 serotype 



2728 


Pre-immune 


Anti-19224135 


Change 


151.38 


471.95 


321 



Figure 87 provides the results of FACS analysis for surface expression of 19224137 on GAS 
serotype M12 2728. A shift in fluorescence is observed when anti-19224137 antiserum is present, 
demonstrating cell surface expression. Table 36, below, quantitatively summarizes the FACS 
fluorescence values obtained for GAS serotype Ml 2 2728 in the presence of pre-immune antiserum, 
anti-19224137 antiserum, and the difference in fluorescence value between the pre-immune and anti- 
19224137 antiserum. 

Table 36: Summary of FACS values for surface expression of 19224137 in an M12 serotype 



2728 


Pre-immune 


Anti-19224137 


Change 


140.44 


433.25 


293 



Figure 88 provides the results of FACS analysis for surface expression of 19224141 on GAS 
serotype M12 2728. A shift in fluorescence is observed when anti-19224141 antiserum is present, 
demonstrating cell surface expression. Table 37, below, quantitatively summarizes the FACS 
fluorescence values obtained for GAS serotype Ml 2 2728 in the presence of pre-immune antiserum, 
anti-19224141 antiserum, and the difference in fluorescence value between the pre-immune and anti- 
19224141 antiserum. 
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p ffalle.^ surface expression of 19224141 in an M12 serotype 



2728 


Pre-immune 


Anti-19224141 


Change 


147.02 


498 


351 



19224139 (designated as orf2) may also be expressed on the surface of GAS serotype Ml 2 
bacteria. Figure 175 shows the results of FACS analysis for surface expression of 19224139 on M12 
strain 2728. A slight shift in fluorescence is observed, which demonstrates that some 19224139 may 
5 be expressed on the GAS cell surface. 

Surface expression of 19224135 on M12 serotype GAS has also been confirmed by Western 
blot analysis. Figure 99 shows that while pre-immune sera (P a-4135) does not detect GAS Ml 2 
expression of 19224135, anti-19224135 immune sera (I a-4135) is able to detect 19224135 protein in 
both total GAS M12 extracts (M12 tot) and GAS M12 fractions enriched for cell surface proteins 

10 (M12 surf prot). The 19224135 proteins detected in the total GAS M12 extracts or the GAS M12 

extracts enriched for surface proteins are also present as high molecular weight structures, indicating 
that 19224135 maybe in an oligomeric (pilus) form. See also Figure 108, which provides a further 
Western blot showing that anti-19224135 antiserum (Anti-19224135) immunoreacts with high 
molecular weight structures in GAS M12 strain 2728 protein extracts enriched for surface proteins. 

15 Surface expression of 19224137 on Ml 2 serotype GAS has also been confirmed by Western 

blot analysis. Figure 100 shows that while pre-immune sera (P a-4137) does not detect GAS M12 
expression of 19224137, anti-19224137 immune sera (I a-4137) is able to detect 19224137 protein in 
both total GAS Ml 2 extracts (Ml 2 tot) and GAS M12 fractions enriched for cell surface proteins 
(M12 surf prot). The 19224137 proteins detected in the total GAS M12 extracts or the GAS M12 

20 extracts enriched for surface proteins are also present as high molecular weight structures, indicating 
that 19224137 may be in an oligomeric (pilus) form. See also Figure 108, which provides a further 
Western blot showing that anti-19224137 antiserum (Anti-19224137) immunoreacts with high 
molecular weight structures in GAS M12 strain 2728 protein extracts enriched for surface proteins. 
Streptococcus pneumoniae 

25 Adhesin island sequences can be identified in Streptococcus pneumoniae genomes. Several 

of these genomes include the publicly available Streptococcus pneumoniae TIGR4 genome or 
Streptococcus pneumoniae strain 670 genome. Examples of these S. pneumoniae AI sequence are 
discussed below. 

S. pneumoniae Adhesin Islands generally include a series of open reading frames within a S. 
30 pneumoniae genome that encode for a collection of surface proteins and sortases. A S. pneumoniae 
Adhesin Island may encode for amino acid sequences comprising at least one surface protein. 
Alternatively, an S. pneumoniae Adhesin Island may encode for at least two surface proteins and at 
least one sortase. Preferably, a S. pneumoniae Adhesin Island encodes for at least three surface 
proteins and at least two sortases. One or more of the surface proteins may include an LPXTG motif 
35 (such as LPXTG (SEQ ID NO: 122)) or other sortase substrate motif. One or more S. pneumoniae AI 
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sIMew]^ of a pilus structure on the surface of the S. 

pneumoniae bacteria. 

S. pneumoniae Adhesin Islands of the invention preferably include a divergently transcribed 
transcriptional regulator. The transcriptional regulator may regulate the expression of the S. 
pneumoniae AI operon. 

The S. pneumoniae AI surface proteins may bind or otherwise adhere to fibrinogen, 
fibronectin, or collagen. 

A schematic of the organization of a S. pneumoniae AI locus is provided in Figure 137. The 
locus comprises open reading frames encoding a transcriptional regulator (rlrA), cell wall surface 
proteins (rrgA, rrgB, rrgC), and sortases (srtB, srtC, srtD). Figure 137 also indicates the S. 
pneumoniae strain TIGR4 gene name corresponding to each of these open reading reading frames. 

Tables 9 and 38 identify the genomic location of each of these open reading frames in S. 
pneumoniae strains TIGR4 and 670, respectively. 

Table 9: S. pneumoniae AI sequences from TIGR4 



Genomic Location 


Strand 


Length 


PID 


Synonym (AI Sequence 
Identifier) 


Functional description 


436302..437831 




509 


15900377 


SP0461 


transcriptional regulator 


438326..441007 


+ 


893 


15900378 


SP0462 


cell wall surface anchor family protein 


441231. .443228 


+ 


665 


15900379 


SP0463 


cell wall surface anchor family protein 


443275..444456 


+ 


393 


15900380 


SP0464 


cell wall surface anchor family protein 


444675..444806 




43 


15900381 


SP0465 


hypothetical protein 


444857..44S696 


+ 


279 


15900382 


SP0466 


sortase 


445791..446576 


+ 


261 


15900383 


SP0467 


sortase 


446563..447414 


+ 


283 


15900384 


SP0468 


sortase 



Genomic Location 


Strand 


AI Sequence 
Identifier\ 


Functional description 


4383-5645 




Orfl_670 


IS 1167, transposase 


5910-7439 




Orf2j570 


transcriptional regulator, putative 


7934-10606 




Orf3_670 


cell wall surface anchor family protein 


10839-12773 


+ 


Orf4_670 


cell wall surface anchor family protein 


12796-14001 




Orf5 670 


cell wall surface anchor family protein 


14327-15241 


+ 


Orf6 670 


sortase, putative 


15336-16121 


+ 


Orf7 670 


sortase, putative 


16108-16959 


+ 


Orf8_670 


sortase, putative 



The full-length nucleotide sequence of the S. pneumoniae strain 670 AI is also shown in 
Figure 101, as is its translated amino acid sequence. 

At least eight other S. pneumoniae strains contain an adhesin island locus described by the 
locus depicted in Figure 137. These strains were identified by an amplification analysis. The 
genomes of different S. pneumoniae strains were amplified with eleven separate sets of primers. The 
sequence of each of these primers is provided below in Table 41. 

Table 41: Sequences of primers used to amplify AI locus 



Primer 



Forward Primer Sequence 



Reverse Primer Sequence 
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Lti ::;r,H LB >/ id: si:;::. ..:;* 




1 


ACTTTCTAATGAGTTGTTT AGGCG 


AGCGACAAGCCACTGTATCATATT 


2 


CTGGTCGATAACTCCTTCAATCTT 


GTACGACAAAAGTGTGGCTTGTT 


3 


GAATGCGATATTCAGGACCAACTA 


ATCTCACTGAGTTAATCCGTTCAC 


4 


TGTATACAAGTGTGTCATTGCCAG 


CATCTTCACCTGTTCTCACATTTT 


5 


GCGGTCTTTAGTCTTCAAAAACA 


CAAGAGAAAAACACAGAGCCATAA 


6 


TTGCTTAAGTAAGAGAGAAAGGAGC 


CAGGAGTATAGTGTCCGCTTTCTT 


7 


GGC A ATGTTG ACTTT ATG A AGG TG 


TA 1 C AOCA 1 l-UL 1 1 1A11/1 1 


8 


TGAGATTTTCTCGTTTCTCTTAGC 


AATAGACGATGGGTATTGATCATGT 


9 


CCGACGAACTTTGATGATTTATTG 


ACCAACAGACGATGACTGTTAATC 


10 


AATGACTTTGAGCCTGTCTTGAT 


TTCTACAATTTCCTGGCCATTATC 


11 


GCCATTTGGATCAGCTAAAAGTT 


TTTTTCAACCCACTACAGTTGACA 



These primers hybridized along the entire length of the AI locus to generate amplification products 
representative of sequences throughout the locus. See Figure 138, which is a schematic of the 
location where each of these primers hybridizes to the S. pneumoniae AI locus. Figure 139A provides 
5 the set of amplicons obtained from amplification of the AI locus in S. pneumoniae strain TIGR4. 
Figure 139B provides the length, in base pairs, of each amplicon in S. pneumoniae strain TIGR4. 
Amplification of the genome of S. pneumoniae strains 19A Hungary 6, 6B Finland 12, 6B Spain 2, 9V 
Spain 3, 14 CSR 10, 19F Taiwan 14, 23F Taiwan 15, and 23F Poland 16 produced a set of eleven 
amplicons for the eleven primer pairs, indicating that each of these strains also contained the S. 

10 pneumoniae AI locus. 

The S. pneumoniae strains were also identified as containing the AI locus by comparative 
genome hybridization (CGH) analysis. The genomes of sixteen 51 pneumoniae strains were 
interrogated for the presence of the AI locus by comparison to unique open reading frames of strain 
TIGR4. The AI locus was detected by this method in strains 19A Hungary 6 (19AHUN), 6B Finland 

15 12 (6BFIN12), 6B Spain 2 (6BSP2), 14CSR10 (14 CSR10), 9V Spain 3 (9VSP3), 19F Taiwan 14 
(19FTW14), 23F Taiwan 15 (19FTW15), and 23F Poland 16 (23FP16). See Figure 140. 

The AI locus has been sequenced for each of these strains and the nucleotide and encoded 
amino acid seqeunce for each orf has been determined. An alignment of the complete nucleotide 
sequence of the adhesin island present in each of the ten strains is provided in Figure 196. Aligning 

20 the amino acid sequences encoded by the orfs reveals conservation of many of the AI polypeptide 

amino acid sequences. For example, Table 39 provides a comparison of the percent identities of the 
polypeptides encoded within the £. pneumoniae strain 670 and TIGR4 adhesin islands. 



Table 39: Pecent identity comparison of S. pneumoniae strains AI sequences 



S. pneumoniae strain 670 
polypeptide 


S. pneumoniae from TIGR4 
polypeptide 


Shared identity of polypeptides 


Orfl 670 


SP0460 


99.3% identity in 422 aa overlap 


Orf2 670 


SP0461 


100.0% identity in 509 aa overlap 


OrB 670 


SP0462 


83.2% identity in 895 aa overlap 


Orf4 670 


SP0463 


47.9% identity in 678 aa overlap 


Orf5 670 


SP0464 


99.7% identity in 393 aa overlap 


Orf6_670 


SP0466 


100.0% identity in 279 aa overlap 


Orf7 670 


SP0467 


94.2% identity in 260 aa overlap 


Orf8 670 


SP0468 


91.5% identity in 283 aa overlap 
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Figures 141-147 each provide a multiple sequence alignment for the polypeptides encoded by one of 
the open reading frames in ail ten Al-positive S. pneumoniae strains. In each of the sequence 
alignments, light shading indicates an LPXTG motif and dark shading indicates the presence of an E- 
box motif with the conserved glutamic acid residue of the E-box motif in bold. 

The sequence alignments also revealed that the polypeptides encoded by most of the open 
reading frames may be divided into two groups of homology, S. pneumoniae Al-a and Al-b. S. 
pneumoniae strains that comprise Al-a include 14 CSR 10, 19A Hungary 6, 23F Poland 15, 670, 6B 
Finland 12, and 6B Spain 2. S. pneumoniae strains that comprise Al-b include 19F Taiwan 14, 9V 
Spain 3, 23F Taiwan 15, and TIGR4. An immunogenic composition of the invention may comprise 
one or more polypeptides from within each of S. pneumoniae Al-a and Al-b. For example, 
polypeptide RrgB, encoded by open reading frame 4, may be divided within two such groups of 
homology. One group contains the RrgB sequences of six S. pneumoniae strains and a second group 
contains the RrgB sequences of four $. pneumoniae strains. While the amino acid sequence of the 
strains within each individual group is 99-100 percent identical, the amino acid sequence identity of 
the strains in the first relative to the second group is only 48%. Table 41 provides the identity 
comparisons of the amino acid sequences encoded by each open reading frame for the ten S. 
pneumoniae strains. 



Table 42: Conservation of amino acid sequences encoded by the S. pneumoniae AI locus 



Putative Role of Polypeptide 


Encoded 
by Orf 


Groups of 
Homology 


% Identity in 
Group 


% Identity 
Between 
Groups 


RlrA, transcriptional regulator 


2 


1 group (10 strains) 


100 




RrgA, cell wall surface protein 


3 


2 groups (6 4- 4) 


98-100 


83 


RrgB, cell wall surface protein 


4 


2 groups (6 + 4) 


99-100 


48 


RrgC, cell wall surface protein 


5 


2 groups (6 + 4) 


99-100 


97 


SrtB, putative sortase 


6 


2 groups (7 + 3) 


99-100 


97 


SrtC, putative sortase 


7 


2 groups (6 + 4) 


[ 95-100 


93 


SrtD, putative sortase 


8 


2 groups (6 + 4) 


99-100 


92 



The division of homology between the RrgB polypeptide in the S. pneumoniae strains is due a 



lack of amino acid sequence identity in the central amino acid residues. Amino acid residues 1-30 
and 617-665 are identical for each of the ten S. pneumoniae strains. However, amino acid residues 
3 1-616 share between 42 and 100 percent identity between strains. See Figure 149. The shared N- 
and C-terminal regions of identity in the RrgB polypeptides may be preferred portions of the RrgB 
polypeptide for use in an immunogenic composition. Similarly, shared regions of identity in any of 
the polypeptides encoded by the S. pneumoniae AI locus may be preferable for use in immunogenic 
compositions. One of skill in the art, using the amino acid alignments provided in Figures 141-147, 
would readily be able to determine these regions of identity. 

The S. pneumoniae comprising these AI loci do, in fact, express high molecular weight 
polymers on their surface, indicating the presence of pili. See Figure 182, which shows detection of 
high molecular weight structures expressed by S. pneumoniae strains that comprise the adhesin island 
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